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Abstract 

We consider a model of a population of fixed size N undergoing selection. Each individual 
acquires beneficial mutations at rate and each beneficial mutation increases the individ¬ 
ual’s fitness by sat. Each individual dies at rate one, and when a death occurs, an individual 
is chosen with probability proportional to the individual’s fitness to give birth. Under certain 
conditions on the parameters and sn, we obtain rigorous results for the rate at which 
mutations accumulate in the population and the distribution of the fitnesses of individuals in 
the population at a given time. Our results confirm predictions of Desai and Fisher (2007). 


1 Introduction 

We consider the following model of a population undergoing selection. We assume there are 
exactly N individuals in the population at all times. Each individual independently acquires 
mutations at times of a Poisson process with rate /Uat, and all mutations are assumed to be 
beneficial. Each individual is assigned a fitness, which depends on how many mutations the 
individual has acquired relative to the mean of the population. More precisely, let Xj{t) be the 
number of individuals with j mutations at time t, and let 

- OO 

j=0 

be the average number of mutations for the N individuals in the population at time t. Then the 
fitness of an individual with j mutations at time t is 

max{0,1-b SAr(j - M(p)}. (1.1) 

Each individual independently lives for a time which is exponentially distributed with mean one, 
then dies and gets replaced by a new individual. The parent of the new individual is chosen at 
random from the population, and the probability that a particular individual is chosen as the 
parent is proportional to that individual’s fitness. The new individual inherits all of its parent’s 
mutations. Note that this model includes two parameters: the mutation rate /tat and the selection 
parameter sn- 
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This model is of interest mostly because it is essentially the simplest possible model that 
allows for repeated beneficial mutations. The model has appeared previously in the literature; 
see, for example, An alternative to m, which was considered, for example, in [2l [8l fT9]. 

is to assign a fitness of (1 + s^y to an individual with j mutations. However, assumption A3 
below will ensure that for the range of parameters that we will consider, sjv(j — M{t)) is small 
and therefore the approximation 1 + sjsfU — M{t)) ps (1 + is valid. Consequently, 

the distinction between these two models is not important for our purposes. A limitation to our 
model is that the selective advantage sn is assumed to be the same for every beneficial mutation. 
Some authors have considered models in which the selective advantage resulting from a mutation 
is random (see [HIIIKITIIIHIEH]), but we do not consider this complication here. 

Here we will be interested in determining how rapidly the population acquires beneficial 
mutations, that is, how fast M{t) grows as a function of t. This growth rate is sometimes called 
the rate of adaptation or the speed of evolution. Also, we will be interested in understanding the 
distribution of the fitnesses of individuals in the population at a given time. 


1.1 Previous work 


The behavior of the population in this model can vary considerably depending on the values of 
the parameters fijsf and sat. The simplest case to handle is when the mutation rate /tat is small 
enough that there is only one beneficial mutation in the population at a time. This occurs, for 
example, when sat = s > 0 is a fixed constant and limAr^oo A*iv(A^log A") = 0. In this case, there 
is approximately an exponentially distributed waiting time until there is a so-called selective 
sweep, in which a beneficial mutation appears on one individual and then spreads to the entire 
population, followed by another exponentially distributed waiting time until another selective 
sweep occurs, and so on. See, for example. Chapter 6 of [6] for details. However, the process 
becomes much more complicated as soon as mutations occur rapidly enough that there can be 
more than one beneficial mutation in the population at a time. 

Another case that has been studied in detail is when Nnjy —)• a G (0,oo) and Nsjsf ^ 7 G 
(0,oo) as A —>■ 00 . That is, the mutation rate hn and the selection parameter sn are both of 
the order 1/A. In this case, one can describe the process using diffusion theory. For a summary 
of results in this direction, see sections 7.2 and 8.1 of [6] and chapter 10 of [9]. 

An important paper which establishes rigorous results is the work of Durrett and Mayberry 
[8] , who were motivated by cancer modeling. They considered the variation of the model in which 
the fitness of an individual with j mutations is given by (1 -|- s)-^, where s is a fixed constant not 
depending on A. They also assumed that ^a^ ~ A““, where 0 < a < 1. They showed that if 
Tj = min{t : Xj{t) > 1} is the first time when an individual gets j mutations, then 


sT,' 


log(l//XAr) 


>p tj 


for a certain deterministic sequence of constants where -^p denotes convergence in prob¬ 

ability as A — 00 . They also obtained more precise results describing how the number of type j 
individuals evolves over time. 

Yu, Etheridge, and Cuthbertson [24] considered very fast mutation rates, where > 0 

and Sat = s > 0 for all A. That is, neither the mutation rate nor the selection parameter depends 
on A. The model they considered is slightly different from the one presented here in that an 
individual’s fitness affects its death rate as well as its birth rate. They observed that the process 
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that keeps track of the differences between the fitness of the individuals and the mean fitness of 
the population has a stationary distribution. They proved that if the process starts from this 
stationary distribution, then for all <5 > 0, we have 


E[M{t) - M{t - 1)] > (log A^) 


l-(5 


if N is sufficiently large, thus establishing a lower bound of (log on the rate of adaptation. 

Kelly |14] considered the same model and obtained a corresponding upper bound by showing 
that if at time zero there are no mutations in the population, then 

E[M{t)] ^ ClogN 


(log log A^)2 

for t > log log A^, where C is a positive constant. However, up to now, the precise asymptotic 
rate of adaptation has not been calculated rigorously in this case. 

Although there are only a few rigorous results available for this model, there has been a 
considerable amount of previous nonrigorous work on this model and closely related models, 
mostly appearing in the Biology literature. Of particular relevance for the present paper is the 
work of Desai and Fisher [3], who carried out a precise and detailed analysis of this model. They 
found, under certain conditions on the parameters sn and fiN, that the difference in the number 
of mutations between the fittest individual in the population and a typical individual in the 
population is approximately 

2log{NsN) 2 ) 

log(sAr//iAr) 

and that in the long-run, the number of mutations carried by a typical individual in the population 
increases at the rate of approximately 


2sn logjNsN) 

[iog{sN/mW 


(1.3) 


per unit time. See the discussion around equations (4) and (5) on p. 1765 of [3] for a brief 
explanation, and see the discussion around (40) and (41) on p. 1774 of [3] for a more detailed 
analysis. See also Brunet, Rouzine, and Wilke [3] for further analysis of these results. The 
heuristic arguments in [3] are discussed in more detail in section [2] below, and are largely the 
basis for the rigorous results proved in this paper. 

Rouzine, Brunet, and Wilke m studied the same problem using a different approach, building 
on earlier work of Rouzine, Wakeley, and Coffin [20], and estimated the rate of increase in the 
number of mutations carried by a typical individual in the population to be approximately 


2sn ^Og{Ny/SNfJ-N) 


[\og{{sN/^iN) log{N 


2 ’ 


(1.4) 


which will match ()1.3p asymptotically as long as the extra factors inside the logarithms can be 
ignored. See equation (53) in [TO], and see section A.l of m for a discussion of the assumptions 
required for (II.4p to be valid. 

In addition to obtaining the estimates ()1.3p and (II.4p on the speed of evolution, these and 
other authors have considered the distribution of fitnesses of individuals in the population at a 
given time, coming to the conclusion that this distribution should be approximately Gaussian. 
See, for example, the discussion at the top of p. 1775 in |3|, the mathematical appendix in 
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and the discussion around (11) of [19]. Other heuristic arguments for why the distribution of 
fitnesses should be approximately Gaussian are given in section 3 of [23| and in the supporting 
information to |2]. Because the mean of this Gaussian distribution is increasing in time as the 
population evolves, the evolution of the fitness distribution in the population can be modeled 
as a Gaussian traveling wave. This point of view is emphasized in |2| and can be traced back 
at least to [22]. It should be noted that Durrett and Mayberry [8] rigorously obtained traveling 
wave behavior in their model. However, for the low mutation rates that they considered, the 
number of values of j for which Xj{t) > 0 does not tend to infinity as —>■ oo. Consequently, 
they did not observe a traveling wave with a Gaussian shape, and indeed the Gaussian traveling 
wave picture has not been established rigorously for any range of parameter values. 

The goal of this paper is to carry out a detailed, mathematically rigorous analysis of the model 
described above. Under certain conditions on sn and /tat, we are able to confirm several of the 
most important nonrigorous predictions about the model. We obtain rigorous results concerning 
the speed of evolution and the distribution of fitnesses of individuals in the population at a given 
time. We present our assumptions in section 11.21 and our main results in section 11.31 In section 
[21 we explain the heuristics behind the results, most of which are adapted from the previous 
nonrigorous work mentioned above. The rest of the paper is devoted to proving the main results. 

This is the first in a series of two papers devoted to the study of this model. In the follow-up 
paper |2I], we show that the genealogy of the population can be described by a process called the 
Bolthausen-Sznitman coalescent, confirming predictions of Desai, Walczak, and Fisher [5| and 
Neher and Hallatschek m- The paper [2T] uses extensively the results and techniques developed 
here. 


1.2 Assumptions on the parameters 

For deterministic sequences {xn)n=i (yA^)?^=i depending on the population size N, we write 
XN ~ UN if ^^xcim^ooXn/un = 1 - We write xn < Vn if ^^xcim^ooXn/un = 0 and xn > Un if 

liuiN^ooXN/yN = OO. 

For our main results, we will need the following assumptions on the parameters sjv and /iTv: 

log N 

Al: We have lim -— -;-——-—;—- = oo. 

N^oo log(SAr//iAr) log(l/SAr) 


A2: We have 


log A 

N^oo [log(sAr//rAr)]2 


log 


/ logiv \ 

\log{sN/m)J 


= 0 . 


A3: We have lim ^ = q. 

N^oo log{SN/fJ-N) 


The biological meaning of these assumptions, and the reason why they are needed for the main 
results, will be described later in section 1231 Here we mention some of their consequences. 

Dividing A3 by Al, we see that the assumptions imply that limAr^oo sat log(l/sAr) = 0 and 
therefore 

lim sn = 0 . ( 1 - 5 ) 

N^OO 


This result and Al imply that 


log A 


Af^oo log{sN/^ n) 


= oo, 


( 1 . 6 ) 
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and combining this observation with A2 gives 


logA^ 

N^co [log(sAr//iAr)]" 


= 0 . 


(1.7) 


Dividing ([121) by Al, we get limAr^.oolog(l/sAr)/log(sAr/^Ar) =0. Thus, log(l//iAr) > log(l/sAr), 
which means that for all a > 0, we have 


^J‘N < S%. (1.8) 

That is, the mutation rate tends to zero faster than any power of sat. Another consequence of 
the fact that limAr^oo ^og{l/si\f)/log{si\f /= 0 is that log{sN /~ log(l//iAr)- In particular, 
()1.6p implies that logAl S> log(l//iA''), which means that for all a > 0, we have 

m (1.9) 

That is, the mutation rate tends to zero more slowly than any power of 1/N. Also, note that 
because log(sAr//UAf) ~ log(l//iAr), the expression log(s7v/l^A'') could be replaced by log(l//UAf) in 
any of the conditions Al, A2, and A3. We state the conditions in their current form because 
log{sN/fJ-N) arises more naturally, as we will see later. We will always assume N is large enough 
that /iAf < Sn, so log(sAr//iAf) > 0. 

To illustrate how these assumptions can be satisfied, we observe that if 1/2 < 6 < 1 and 
0 < a < 1 — 6, and if for all N we have 

and 


then assumptions A1-A3 hold 


g-(log7V)“ < < 




1.3 Main results 


Let 


UN = -log 

SN 



( 1 . 10 ) 


We will see later that, as was observed in [5], the quantity a at is approximately the amount of 
time between when the first individual with j mutations appears and when individuals in the 
population have j mutations on average. This is the time scale on which we will study the process. 
Also, define 


kN 


log Al 

log(sAr/AtAr)’ 


( 1 . 11 ) 


which we will see is the natural scale on which to consider the number of mutations. For t > 0, 
let 

Q{t) = max{j : Xj{t) > 0} — M(t) (1-12) 


be the difference between the number of mutations carried by the fittest individual in the popu¬ 
lation and the mean number of mutations in the population. Our first theorem is an asymptotic 
result for this quantity. Here and throughout the paper, the notation —denotes convergence in 
probability as N ^ oo. 


5 






Theorem 1.1. Assume A1-A3 hold, 
such that 



There is a unique bounded function q : [0, oo) —[0, oo) 


e* if 0 <t < 1 

Jt-i di'^) du ift> 1. 


(1.13) 


If S is a compact subset of (0,1) U (1, oo), then 


sup 

tes 


QioNt) 


In 




0 . 


(1.14) 


Furthermore, we have 


lim q{t) = 2. 

>00 


Note that Theorem 11.11 implies that for large t, we have 


(1.15) 


Q{aNt) 


2 log N 

\og{sN/hNy 


(1.16) 


which is consistent with Desai and Fisher’s prediction (II.2p because | log sat| <C log when Al- 
A3 hold. Note also that the function q is discontinuous at 1, which is why we can not expect 
uniform convergence to hold over intervals containing 1. 

The next result is our main theorem concerning the speed of evolution. It shows how the 
mean number of mutations in the population changes over time. 


Theorem 1.2. Assume A1-A3 hold. Let m : [0,oo) be the function defined by 


r 0 ifo<t<i 

I 1 du ift> 1, 


(1.17) 


where q is the function defined in I11.13\) . Then, if S is a compact subset of [0,1) U (l,oo), we 
have 

0. (1.18) 

Note that the function m is discontinuous at 1, so Theorem 11.21 implies that the average 
number of mutations of individuals in the population stays close to zero until time un, then 
rapidly increases to approximately /cat. To see the long-run rate at which the population acquires 
beneficial mutations, note that (|1.15p implies that 


sup 

t^s 


M (oTvt) 


kN 


— m{t) 


lim 

t^OO 


m{t) 

t 


= 2 . 


(1.19) 


Therefore, for large t, 

M{aNt) _ m{t)kN _ 2 sn log N 

ONt ONt [log(sAr/liAf)]^’ 

The right-hand side of (ll.2nh can be viewed as the rate of adaptation, or the rate per unit time at 
which new mutations take hold in the population. Because |logSAr| logA^ and log(l/^Ar) ^ 
logA^ when A1-A3 hold, as can be seen from (jl.Sp and dOP, and log log <C log{sN/TN) by 
(lEZl), this result is consistent with the predictions (|1.3p and (11.41) . 
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Remark 1.3. The functions q and m have a renewal theory interpretation, which helps to 
explain (11.151) and (11.191) . Consider a renewal process in which the distribution of the time 
between renewals is uniform on (0,1). Let N{t) be the number of renewals by time t, and let 
U{t) = E[N{t)]. The renewal equation gives 

U{t) = {tAl)+ [ U{x)dx. 

Let U' denote the right derivative oiU. If 0 < t < 1, then U'{t) = 1 + U{t), and since U{0) = 0, it 
follows that U{t) = e*—1 and thus U'{t) = e*. Iff > 1, then U'{t) = U{t)—U{t—1) = U'{u)du. 
It follows that U' satisfies (jl.l3l) . so U'{t) = q{t) for all t. Also, for t > 1, 

l-t-i 

m{t) = 1 + / U'{u) du = 1 + U{t — 1). 

Jo 

For large t, because the uniform distribution on (0,1) has mean 1/2, we have U{t) ~ 2{t — 1), 
which explains (I1.19p . 

Next we state our main result for the distribution of fitnesses of individuals in the population 
at a given time. Let tq = 0 and for j G N, let 

Tj = inf > — 1, (1.21) 

I d-N J 

which we will see later is approximately the time when some individuals with j — 1 mutations 
start to acquire a jth mutation. Also, let 


7j = Tj + aAr. (1-22) 

We will see later that most individuals in the population between times 7 j and 7 j+i will have j 
mutations. For t > 1, let 

j{t) = max{j : 7 ^- < aNt}. (1.23) 

On the event that 7 j(t) < 7 j(t)+i < 00 , which we will see later has probability close to one, let 
d{t) be the number in [— 1 / 2 , 1 / 2 ) such that 

ant = (1/2 - d(f)) 7 j(t) + (1/2 + fi(t))7j7)+i- (1-24) 

Otherwise, let d{t) = 0. 

Theorem 1.4. Assume A1-A3 hold. For each 7 > 0 and t G (1,2) U (2, 00 ), there exists 6 > 0 
sueh that 

, f Xj(t)+£{aNt) \ [log(sAf/ 7 Af)]^(£^ - 2 £d(t)) ^ r]f[\og{sN/hN)? 

V Xj(t){aNt) J 2q{t-l)logN ~ logiV 

for all £ G [—6kN, OkN] O Z | = 1. 


lim P 

N^OO 


Furthermore, 6 = 0{rj,t) can he chosen as a function of rj and t such that for each fixed 7 > 0 
and a > 2, we have 

inf 6{r],t) > 0. (1-25) 

tS[a,oo) 
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Theorem 11.41 compares the number of individuals with j{t) mutations to the number of in¬ 
dividuals with j{t) + i mutations at time ai\ft. To see why this result is consistent with the 
conjecture from section 11.11 that the distribution of fitnesses is Gaussian, note that if Z is a 
random variable having a Gaussian distribution with mean x + d and variance cr^, and / is the 
probability density function of Z, then 


log 


V m ) 





e - 2id 

2ct2 


Therefore, the result of Theorem II .41 suggests that, in some sense, the distribution of the fitnesses 
of individuals in the population at time a^t is approximately Gaussian with a mean of j{t) + d{t) 
and a variance of 

2 _ g(t- l)logA^ 

[log(s^//i^)]2- 

It should be noted, however, that dLH) implies that limAr^oo (t) = 0. Gonsequently, the 
distribution of fitnesses of individuals in the population at time oatI does not actually converge 
to a Gaussian distribution as —>■ oo. Rather, the fraction of individuals in the population 
with exactly j{t) mutations will be close to 1, unless \d{t)\ is very close to 1/2. Nevertheless, the 
appearance of 1“^ — 2£d{t) in Theorem oi demonstrates Gaussian-like tail behavior. 


1.4 Notation 

We collect here for the convenience of the reader some of the most important notation used 
throughout the paper. Because most of this notation has not yet been introduced, the reader is 
encouraged to skip this section for now and refer back to it as needed. 

Oat (1/s) log(s/;u), natural time scale for the process 

b Defined in (|3.14l) . used to determine which mutations are “early” 

Bj{t) Birth rate of a type j individual at time t, see (|4.ip 

Dj{t) Death rate of a type j individual at time t, see (14.2|] 

Natural filtration of the process 

Gji't) s(j “ ^{t)) — /U, growth rate of type j population at time t 
j{t) max{j : 'jj < a^t}, corresponds to most common type at time t 

J 3k]\fT + k* + 1, bound on number of types likely to appear by time a^T 

ki\f log N/log(s///), natural scale for the number of mutations 

k^,k^ numbers slightly smaller and larger than kj\f, see (13.2p and (13.31) 
k* largest integer less than k^ 

K L^Ar/4j 

L \l7kN] 

m{t) Scaling limit of > 0), defined in (jl.l7p 

M(t) Mean number of mutations in the population at time t 

M(t) Approximation to mean number of mutations at time t, defined in (|7.16l) 

N Population size 

Q{t) Difference in number of mutations between fittest individual and average 

q{t) Scaling limit of > 0), defined in (jl.l3p 

Qj Approximately the value of Q{Tj), see (|3.15l) 

R{t) Number of tj between t — ajy and t, see (I3.23h 
s = sn Selective advantage resulting from a mutation 
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Sj{t) Number of individuals with j or fewer mutations 

t* Time before which individuals of types up to kjy appear, defined in (|3.6p 

T Large positive number; the process is studied up to time awT 

Xj{t) Approximation to number of individuals with j mutations at time t for t <t* 
Xj{t) Number of individuals with j mutations at time t 

Xj lit) Number of type j individuals at time t descended from mutations before C-i 

Xj,2{t) Xjit) - Xj,iit) 

Zjit) Martingale associated with evolution of type j individuals, see Proposition 14.11 

7 j Tj + otv , approximately when most individuals have acquired j mutations 

6 Small positive number, bounds error in various approximations 

e Small positive number, bounds probability that conclusions of results fail 

( First time that the conclusion of Proposition 13.2113.3113.51 or 13.61 fails 

= fiN Mutation rate for each individual 

Time before which type j mutations are early, see (j3.16p 
Tj inf{t ; Xj-iit) > s/n}, approximately when type j individuals appear 

Tj Tj+aN/{4TkN) 

2 Heuristics 

In this section, we discuss the key ideas behind the main results in the paper. The goal is to 
explain to the reader, in just a few pages of calculations, why the main results are true. Most of 
these heuristics have already appeared in the Biology literature, particularly in the work of Desai 
and Fisher [1] . We postpone rigorous proofs of the results, and justification for the approximations 
used, until later sections, and in this section we assign no precise meaning to the approximation 
symbol ~. Here and throughout the rest of the paper, to lighten notation we write /i and s in 
place of /iAT and sn respectively, even though these parameters depend on the population size N. 

2.1 The initial stage 

Consider first the initial stage of the process, when the average number of mutations in the 
population is close to zero. For times t in this range, we have X^it) ~ N and M(t) ~ 0. During 
this stage, we can approximate the process by a multitype branching process in which a type j 
individual dies at rate 1, gives birth to another type j individual at rate 1 + sj, and mutates to 
type J + 1 at rate This means that the total rate at which type j + 1 individuals appear due 
to mutations is /iAj_i(t), and if such a mutation appears at time u < t, the expected number 
of descendants of this individual in the population at time t is ~ where the 

approximation is valid because // is much smaller than s. This leads to the approximation 

E[Xi{t)]^ f fiE[Xoit)]e^^^-^Uu ^ f du = ~ . (2.1) 

Jo Jo s 

Then an inductive argument gives 

E[X,it)] « [\E[X,_iiu)W<^--^ du ^ - 1)T (2.2) 

Jo ■s J- 

The approximation (12.2p only holds when the mean number of mutations is close to zero, 
which can be true only when Xiit) is much smaller than N. From (12.ip . we see that Ali(t) will 
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be of order N when e®* is comparable to s/fj,, which happens near time 



= aN- 


Before time ajsf, the average number of mutations in the population will be close to zero, and the 
approximation (I2.2p will be valid. 

For the approximation (j2.2p to be useful for understanding the evolution of the number of type 
j individuals, we need to know that Xj(t) ss E[Xj(t)]. We will calculate, using a second moment 
argument, that this approximation holds for small times t when j < k^- This is true essentially 
because, for j < kM, type j individuals appear in the population very quickly. For larger values 
of j, however, it is not true that Xj(t) ~ E[Xj(t)]. Rather, the expectation is dominated by 
rare events in which an individual acquires a jth mutation much earlier than usual, causing the 
number of type j individuals at later times to be unusually large. Therefore, for j > kj\f, we can 
not approximate Xj{t) by its expectation, and we need a different technique to understand the 
process {Xj{t),t > 0). 


2.2 Evolution of the number of type j individuals 

We now consider the evolution of type j individuals when j > kjsf. The key idea is to break the 
process into two stages: an initial stage in which the type j population becomes established as 
a result of mutations experienced by type j — 1 individuals, and a second stage in which these 
mutations are no longer important and the type j population evolves essentially in a deterministic 
way. This idea has been used in previous work on this model, and in particular many of the 
calculations in this section strongly resemble those in [1]. Recall that Tj is the first time when 
there are at least s/fj, individuals of type j — 1 in the population. We will show using a first moment 
argument that with high probability, no type j individuals will appear before time Tj. The 
type j population becomes established during the interval [Tj,Tj+i], then evolves approximately 
deterministically after time Tj+i. 

After time Tj+i, we will see that mutations from type j —1 to type j no longer have a significant 
impact on the size of the type j population. Consequently, at a time u > r^+i, the number of 
type j individuals will be growing approximately deterministically at the rate s{j — M(u)), which 
is the size of the selective advantage that a type j individual has over an individual of average 
fitness. That is, for t > Tj+i, we have 

. S s(j—M(u))du 

Xj{t) PS ^ ” 

k- 

Consider next what happens between times Tj and Tj+i, when the type j population gets 
established. We can use (|2.3I) to approximate the number of type j — 1 individuals shortly after 
time Tj. As long as no type j individual appears before time Tj, we have (j — 1) — M{Tj) = Q{Tj), 
so (12.3p suggests the approximation 


(2.3) 


Xj-i{t) 


k 


(2.4) 


As long as the average fitness of the population does not change much shortly after time Tj, 
a new type j individual that appears because of a mutation at time u will have on average 
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gs{Q{Tj)+i)(t u) descendants at time t. Thus, we have the approximation 


X, 




,sQ(Tj){u-Tj) _ gS(Q(Tj)+l){t-n) 


J Tn 


^s{Q{Tj) + l){t-Tj) 


(2.5) 


where the last approximation requires t — Tj S> 1/s. Therefore, Tj+i should occur approximately 
when the expression in (j2.5h equals s/^, which leads to 


1 , f aN 

^j+i - s{Q{Tj) + 1) V/r j ^ Q{Tj)' 

To estimate Q{Tj), note that (12.41) and (12.5p lead to 

Xj{t) _ e^(Q(^j)+i)(t-^j) _ ^ 


which equals one when 


t-Tj = - log 
s 



= aN- 


That is, the number of type j individuals surpasses the number of type j — 1 individuals approx¬ 
imately otv time units after type j individuals first appear. Around that time, there will be more 
type j individuals than individuals of any other type, and the mean number of mutations in the 
population will be approximately j. It follows that M{Tj) will be approximately the type that 
first appeared roughly ajy time units in the past, and Q(Tj) will be approximately the number 
of new types that have appeared in the last time units. Because the rate per unit time at 
which new types are appearing can be approximated by the reciprocal of the expression in (12.61) . 
we obtain for t > 1 the approximation 


r°-Nt Q{u) 

Qia^t) ~ / - - du= QidNv) dv. 

Jt—l 


(2.7) 


For t < 1, we know from the discussion in section IQ that M^a^t) ~ 0, so Q{aj^t) is ap¬ 
proximately the number of types that have originated before time a^t. Since we know from 
the discussion in section ED that types appear at very small times, we have for t < 1 the 
approximation 

r°-Nt Q{^\ r* 

Q^aiyt) ^ k]\f + / - - du = k^ + / Q{a]sfv)dv, 

Jo O-N Jo 

which implies Q{aMt) ~ kMe^- This result and (|2.7p lead to the approximation to Q{aj^t) in 
Theorem 0 The result (jl.lSp then follows from the renewal theory argument outlined in 
Remark oi 

To understand Theorem ll.2l recall again that M(aArt) ~ 0 for t < 1. For t > 1, we know from 
the discussion in the previous paragraph that M{a^t) is approximately the number of types that 
appear before time a^it — 1). Because k^ types appear near time zero and the rate at which 
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new types appear can be approximated by the reciprocal of the expression in (|2.6p . we get for 
t > 1 the approximation 

/•aiv(i-l) rt-l 

M{aj\ft) ^kj\f+ - - du = kjy + / Q{aNv) dv, 

Jo O-N Jo 

which leads to Theorem ll.2l To obtain the result of Theorem 1 1.41 we use the approximation (|2.3p 
to compare and We refer the reader to the proof of Theorem 11.41 in subsection 19.31 

for the details of this calculation. 

Although the main ideas discussed in this section come from [4], it has been assumed in most 
previous work on this model such as mil] that the population is already in equilibrium. Then 
one can argue that this equilibrium is only possible when (|1.16l) and (|1.20l) hold. One of the 
contributions of the present work is to show how the process arrives at such a state, beginning 
from a population in which no mutations are present. 

2.3 Meaning of the assnmptions 

We briefly discuss here the assumptions required for these results to be valid. Note that (jl.6p is 
equivalent to the condition 

lim kN = oo. 

N^oo 

Since Q{aNt) is of the order k^, assumption Al implies that the number of different types in 
the population at a given time tends to infinity as N —>■ oo. This condition is not satisfied in the 
parameter regime considered by Durrett and Mayberry [8]. Assumption Al also ensures sat is 
large enough for mutations to take hold in the population in the manner described above. 

For the heuristics described in section [22] to be valid, the type j population must be growing 
approximately exponentially after time t^+i, which will happen as long as additional mutations 
from type j — 1 to type j are no longer having a significant impact on the population size. The 
contribution to the type j population from mutations at different times can be seen from the 
integral in the second line of (j2.5|) . The primary contribution to this integral comes when u is 
comparable to 1/s. Consequently, we need Tj+i — Tj 3> 1/s for the number of type j individuals 
to be growing exponentially after time r^+i. In view of (12.6h and the fact that Q{Tj) is the same 
order of magnitude as /cat, this is equivalent to the condition 

-^log(-) »-, 

SkN \fj-J s 

which is equivalent to (1121). Thus, the role of assumption A2 is to ensure that the mutation 
rate ^ is slow enough that we can ignore mutations from type j — 1 to type j after time Tj+i. 
For technical reasons, assumption A2 is slightly stronger than ()1.7|) . but we conjecture that the 
main results of the paper are still true if assumption A2 is replaced by (11.711 . It remains an open 
question to understand how the process evolves if the mutation rates are fast enough that (II.7p 
fails to hold. 

Assumption A3 is equivalent to the condition 

lim sk^ = 0. (2.8) 

N^OO 

Because the difference in fitness between the fittest individual and an individual of average fitness 
is of the order sk]\f, assumption A3 implies that we are not considering very strong selection. 
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3 Structure of the Proofs 


In this section, we state some intermediate results that will lead to the proofs of the main results. 
Some of these intermediate results may also be of independent interest, as they provide some 
insight into how the number of individuals with j mutations evolves over time. Throughout the 
section, we will fix three positive numbers: e, 5, and T. We will use e G (0,1) for the maximum 
allowable probability of some “bad” event and 

0 < (5 < — (3.1) 

100 ^ ^ 

for the maximum allowable error in certain approximations. We will study the process up to 
time ajyT, where T > 1. Throughout the paper, we will introduce some positive constants Cn- 
These constants may depend on the three parameters e, 6, and T, even though this dependence 
will not be specifically mentioned each time. 

On numerous occasions throughout the paper, we will assert that a statement holds “for 
sufficiently large N” . This means that there exists a positive integer Nq, depending on e, 5, and 
T, such that the statement in question holds for N > Nq. Often the statement in question will 
somehow involve the evolution of type j individuals, where j could take values in a certain range, 
typically 0<j<k* or k* + l<j<J, where k* and J are defined below. The statement may 
also involve the time t, which may be permitted to take values in a certain range. In such cases, 
the value of Nq may not depend on j or t. That is, the same Nq must work for all j and t in the 
indicated ranges. 


3.1 The process until time t* 


We begin by considering the initial stage of the process. Recall from subsection 12.11 that for 
j < k]\f = log N/ log(s//i), we expect individuals of type j to appear in the population very early, 
and we expect the number of type j individuals to be well approximated by the right-hand side 
of (j2.2p . To state a precise result, define 


and 


log N log N / log N \ 
^ log(s/|u) log(s/^)2 Vlog(s//r)y 

+ _ log IV 21ogiV / logiV \ 

^ log(s/^) log{s/ny 


Also, let 


k* = max{j G N : j < k~j^} 


be the largest integer less than k%. Assumption A2 implies that 


lim {kt - k^) = 0, 


N^oo 


(3.2) 

(3.3) 

(3.4) 

(3.5) 


so for sufficiently large N, the number of integers j such that k^ < j < k~l^ must be either zero 
or one. Define the time 

,* _ ( (I/'S) log /cat if there exists an integer j such that k~^ < j < /c/) , , 

\ (2/s) log few otherwise 

The following proposition, which we prove in section [5l describes how the process evolves before 
time t*. 
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(3.7) 


Proposition 3.1. For all nonnegative integers j and all t >0, define 

- ly 


Xj{t) = 


sijl 


Then there exist positive constants Ci and C 2 sueh that for suffieiently large N, the following 
four statements all hold with probability at least 1 — e/2: 


1. For all j < k^, we have 

2. For all j G write 


sup \Xj{t) — Xj{t)\ < 6xj{t* 
te[o,t*] 


(3.8) 


log N bj log N 
^ log(s//i) log(s/^)2 

where —l<bj<2, and let dj = max{0,6j}. Then 


logN 

log(s/^) 


Cikj/^Xjit*) < Xj{t*) < C2kj/^Xj{t*). 


(3.9) 


(3.10) 


3. For all t G [0,t*], we have Xk*{t) < s/p,. 

4 . For all j > k~y and t G we have Xj{t) = 0. 

3.2 Evolution of type j individuals 

In this subsection, we consider how the population evolves after time t*. Recall the definitions of 


and from (jl.21l) and (jl.22p . For nonnegative integers j and f > 0, define also 

Gjit) = sij -M{t)) - p, 


(3.11) 


which we can interpret as the rate of growth for the number of type j individuals at time t. We 
will also define the integers 

K=[kN/4.\, L=\nkN]. 

The next proposition describes the evolution, after time t*, of the number of individuals with 
k* or fewer mutations. The first part of the proposition controls the evolution of the type j 
individuals after time t*. The second and third parts provide upper bounds on the number of 
type j individuals as these individuals get close to extinction. 

Proposition 3.2. For suffieiently large N the following statements all hold with probability at 
least 1 — e: 

1. For all j < k* and t G [t*,3k*+K\, we have 

(1 — 6)Xj{t*) exp ^ J Gj{v) dv'^ < Xj{t) < (1 + S)Xj{t*) exp ^ J Gj{v) dv'^ . (3.12) 

2. For all j < k* and t G [ 7 fc«_|_i^, a^vT], we have 


Xj{f) < k%Xj{t*)eyiY) 


Gj{v) dv . 


(3.13) 
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3. On the event that ')k*+L < cl^T, we have Xj{t) = 0 for all j < k* and t > 7 fc*+L. 

We next consider the individuals of type j for j > k* + 1. By part 4 of Proposition 13.11 
individuals of these types typically do not appear until after time t *, so we need to consider how 
these types originate. Define the positive number 


For j > k* + 1, let 


b = log 


24000 r\ 

6 ^e )' 


(3.14) 


j 3 ~ if o-N ~ 2a]\f/k]\f < Tj < a^v + 2a]\f/k]\f 

\ j — M[Tj) otherwise 


and 

Qj = max{l,q*}. (3.15) 

Then dehne 

{, = max|r„r, + Tl„g(^T)+±|, (3,16) 

When an individual with j — 1 mutations gets an additional mutation, we call this a type j 
mutation. Each type j individual in the population at time t has an ancestor that got a type j 
mutation at some earlier time. We call the individual an early type j individual if this type j 
mutation happened at or before time Let Xj^i{t) be the number of early type j individuals 
at time t, and let Xj^ 2 {t) be the number of other type j individuals at time t. This means, of 
course, that 

X,(t) =X,-i(t) + X,-2(t). 


Also, define the time 


* I 

D = D + 


aN 

ATkN' 


(3.17) 


The result below describes the evolution of the type j individuals for j > k* + 1. The first 
two parts of the proposition concern the evolution of the type j individuals up to time Tj+i and 
require classifying the type j individuals as being early or not early. The remaining three parts 
parallel the three parts of Proposition 13.21 


Proposition 3.3. There exists a positive constant C 3 such that for sufficiently large N, the 
following statements all hold with probability at least 1 — e: 


1. For all j > k* + 1 and all t € [rt, Tj+i] n [0, ajyT], we have 


Xj,i{t) < Csexp 


Gj{v) dv 


(3.18) 


Also, Xj^iif) < s/2pL for all t < t* A a^T, and no early type j individual acquires a type 
j + 1 mutation until after time Tj+i A otvE. 


2. For all j > k* + 1 and all t G [r*, Tj+i] n [0, ajsfT], we have 


(1 — 4(5) exp ^ J Gj{v) dv'^ < Xj^ 2 {t) < (1 + 4:5) exp ^ J Gj{v) dv'^ . (3.19) 

Moreover, the upper bound holds for all t G n [0,OArr]. 
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3. For all j > k* + 1 and all t G [rj+i, 7 ^+/^] n [0, oatT], we have 

- -^ exp ^ J Gj{v) dv'^ < Xj{t) < - —i^ exp ^ J Gj{v) dv'^ . (3.20) 

4 . For all j > k* + 1 such that 'Jj+k < ci^T, we have 

Xj{t) < — exp ( f 

h \J Tj + l 

for all t G [yj+K,aNT]. 

5. For all j > k* + 1 such that ^j+L < cl^T, we have Xj{t) = 0 for all t > Jj+l- 

Remark 3.4. When the statement of part 1 of Proposition 13.31 holds, the number of early type j 
individuals can not reach s/fj, until after time tjAgnT, and because f,j > tj by definition, no other 
type j individuals appear until after time tj. It follows that if j > /c* + 1, then > tj A aj\fT. 

The next proposition shows how the mean number of mutations in the population evolves 
over time. Note that the mean number of mutations in the population is near zero before time 
utv and is near j during the time interval [ 7 j, 7 j+i). 


dv 


(3.21) 


Proposition 3.5. There exist positive constants C 4 and G 5 such that for sufficiently large N, 
the following statements all hold with probability at least 1 — e: 

1. For all t G (t*,OAr], we have M{t) < 2 >e~^GN-t)_ 

2. For all t G (a7v,7fc*+i)? we have M{t) <kN + C 4 ,. 

3. For all j > k* + 1 and t G [ 7 j, 7 j+i) n [0, aj\fT], we have 

\M{t) - j\ < (3.22) 


4- For all j > k* + 1 and t G [Tj,Tj+i), we have M(t) < j — 1. 

Next, we state a result concerning the differences Tj+i — tj. Here q is the function defined in 

(fTTSll . 

Proposition 3.6. Fort G [0,OArT], let 


R{t) = k*l{t<aN} + #{j >k* + l:t-aN<Tj<t}, (3.23) 

where ffS denotes the cardinality of a set S. For sufficiently large N, the following statements 
all hold with probability at least 1 — s: 


1. We have < 2af^/kf^. 

2. We have 


R{aNt) 
sup — - 

t&[0,T] 


Qit) 


< 5. 
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3. For all j > k* + 1 such that either tj + 2aN/kM < aiqT or Tj+i < omT, we have 


and 


In particular, 


Remark 3.7. Let 


[Tj+F^N 1 + 25 

/ q{t) dt < — - 

Ir+a^ kN 


j-Tj+i/aN 

J Tj j a^v 


1-26 

kN 


Oat 2aN 

< ^i+i - < 


3k N 


kN 


J = S^atT + fe* + 1. 

If the statement of part 3 of Proposition 13.61 holds, then (I3.26P implies that 


ON 


Tj> Tj - Tk*+i > - [k* + 1)) A OnT = OnT. 

3kn 


(3.24) 

(3.25) 

(3.26) 

(3.27) 

(3.28) 


Assuming, in addition, that the last statement of part 1 of Proposition 13.31 holds, it follows that 
no individual of type J + 1 or higher can appear until after time onT. Consequently, throughout 
the paper, it will usually only be necessary to consider individuals of type j for 0 < j < J. 


In section [9l we show how Theorems 11.1111.21 and 11.41 follow from Propositions 13.2113.3113.51 
and 13.61 


3.3 Waiting for the time ( 

Although Proposition 13. II is proved in section [5] independently of the other results in this section, 
it does not seem to be possible to prove Propositions 13.2113.3113.51 and 13.61 sequentially. Proving 
Propositions 13 . 21 and 13 . 31 requires that we have some control over the quantities M{t) and Tj+i — Tj, 
which are established in Propositions 13.51 and 13.61 On the other hand, to prove Propositions 
13.51 and 13.61 it will be necessary to have control over the quantities Xj{t), as established by 
Propositions 13.21 and 13.31 Consequently, we will prove these propositions simultaneously by 
defining a random time C which will be the first time that one of the statements in the above 
propositions fails. We will then show that C > unT with high probability. 

Choose constants Ci and C 2 as in Proposition 13.11 Let 

Co = inf{t < t* : either \Xj(t) — Xj{t)\ > 5xj(t*) for some j < k]^, 
t = t* and (I3.inp fails to hold for some j E {k]if, fc)l)), 

Xk*{t) > s/\x, 

or Xj{t) > 0 for some j > 

Note that Co = 00 if the four statements of Proposition 13.11 all hold. 

Next, for all nonnegative integers j, we will define a random time Ci,j) which is essentially the 
first time that the behavior of the type j individuals violates the conditions of Proposition 13.21 or 
Proposition 13.31 First consider j < k*. For t E [t*,'yk*+K], let Aj{t) be the event that (13.121) fails 
to hold. For t E {'yk*+K,'yk*+L), let Aj{t) be the event that (I3.13|) fails to hold. For t > 7fc*+L, 
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let Aj{t) be the event that Xj{t) > 0. Now consider j > k* + 1. Choose a constant as in 
Proposition 13.31 For t>t*, we say that Aj{t) occurs if t E [rt,rj_|_i] and (13.181) or (13.191) fails to 
hold, if t E and the upper bound in (I3.19|) fails to hold, if t < r* and Xj^i{t) > s/2//, if 

t < Tj+i and an early type j individual acquires a type j +1 mutation at time t, if t E [rj+i, 
and (I3.2n|) fails to hold, if t > 'Jj+k and (I3.21|) fails to hold, or if t > 'Jj+l and Xj(t) > 0. Then 
let 

Cij = inf{t : Ai{t) occurs for some i < j} 

and 

Cl = inf{Ci,i : 0 < j < J}. 

Next, we will define C 2 to be the first time when the result of Proposition 13.51 fails. More 
precisely, choose C 4 and 6*5 as in Proposition 13.51 and let 


C 2 = inf{t : either t E {t*,a]\[] and M{t) > 

t E (oat, 7 fc*+i) and M{t) > kj^ + C 4 , 

for some j > k* + 1 we have t E [ 7 j, 7 j+i) but (I3.22p fails to hold}, 

or for some j > A;* + 1 we have t E [Tj,Tj+i) but M{t) > j — 1}. 


Also, let 

Cs = inf{t ; either t = 2 aN/kN and Tk*+i > 2aN/kN, 

\R{t)/kN - q{t/aN)\ > S, 

there exists j > k* + 1 such that Tj+i < t but (13.241) . (j3.25p . or ()3.26l) fails to hold, 
or there exists j > k* + 1 such that Tj+i > t and t = Tj + 2a]\f/kN}, 

which can be interpreted as the first time when Proposition 13.61 fails. Finally, let 

C = min{Co,Ci,C2,C3}- (3.29) 

Note that C depends on 5 and depends also on e and T through the choice of b in (I3.14h . Also, C 
depends on the constants Ci, ..., Cs. The constants Ci and C2 are chosen independently of the 
others in Proposition 15.91 below. The constant 6*3 is specified below in (j8.54p . The constants C4 
and C5, which depend on Cs, are obtained below in Propositions 16.51 and 16.81 respectively. 

Proposition 3 . 8 . There exist positive constants Ci,... , 0 ^ such that for sufficiently large N, 
the following hold: 

1 . On the event {Co = 00}, either C2 > Ci A Cs or Ci A C2 A Cs > a^T. 

2. On the event {Co = 00}, either Cs > Ci A C2; with strict inequality on the event {C2 < Cij; 
or else Ci A C2 A Cs > a^T. 

3 . We have 

J 

^^({Co = 00} n {Ci,j < C2 A Cs A Ci,i-i A ajsfT}) < -, 

j=0 

using the convention that Ci,-i = 00. 
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We prove parts 1, 2, and 3 of Proposition 13.81 in sections [H [71 and [ 8 ] respectively. Here we 
show how Proposition 13.81 along with Proposition 13.11 implies Propositions 13.21 13.31 13.51 and 
13.61 Essentially, parts 1, 2, and 3 of Proposition 13.81 show that ^ 2 ) Cs; and Cl respectively are 
unlikely to be the first of these three times to occur. This forces C to be pushed beyond time 
a]\fT with high probability. Note that a consequence of this result is that for sufficiently large N, 
the conclusions of Propositions 13.1113.2113.3113.51 and 13.61 simultaneously hold with probability at 
least 1 — s. 


Proof of Propositions fXH \3.S\. [Ql and \S.6[ . By Proposition 13.11 we have P{Co = oo) > 1 — e/2 
for sufficiently large N. By Proposition 13.81 we have 

p(^{Co = oo} n ^ U{Cl,i < C2 A Cs A A OArT}^^ < - 

for sufficiently large N. Combining these results, we get 

P ({Co = 00} n ( Pi iCij > C 2 A Cs A Ci,i-i A aTvT}^ ^ > 1 - e (3.30) 


for sufficiently large N. 

We claim that for sufficiently large N, we must have C > clnT on the event in (I3.30p . By 
part 1 of Proposition 13.81 for sufficiently large N, on {Co = 00 } n {C < clnT}, we must have 
C2 > Cl A Csj which implies either C = Cl or C = Cs- Likewise, part 2 of Proposition 13.81 implies 
that for sufficiently large N, on {Co = 00 } H {C < ajyT}, either C = Ci or C = C2- Since the 
strict inequality required by part 2 of Proposition 13.81 rules out the possibility that C2 = Cs < Ci) 
it follows that for sufficiently large N, on {Co = 00 } n {C < awT}, we must have C = Cii and 
therefore C = Ci,i for some j. However, on the event in (|3.30p . we see by induction on j that we 
can not have C = Ci,i foi' any j < J. 

Hence, for sufficiently large N, we have C > on the event in (13.301) . Thus, by (13.301) . for 
such N we have 

P(C > ajvT) > 1 - e. (3.31) 

Propositions 13.21 13.31 13.51 and 13.61 follow from (j3.3ip . Note that Remark 13.71 implies that on 
{C > clnT}, no individual of type J + 1 or higher appears until after time oatT, which is why it 
is only necessary to consider Ci,j for 0 < j < J. □ 

4 A useful martingale 

In this section, we introduce a martingale which will be useful throughout the paper for controlling 
the fluctuations of the number of type j individuals in the population. 

4.1 Constructing the martingale 

We first record the birth and death rates for different types of individuals. Let Fj{t) be the 
htness of a type j individual at time t, which is max{0,1 + s(j — M{t))}, divided by the sum of 
the fitnesses of the N individuals in the population. Note that, if there is a birth event at time 
t, then Fj{t—) is the probability that a particular type j individual is the one chosen to give 
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birth. As long as every individual’s fitness is strictly positive, the sum of the fitnesses of the N 
individuals in the population is 

OO OO OO OO 

A,(t)(l + s{j - M(t))) = J2Xj{t) + sY,jXj{t) - sM{t)Y,X,{t) 
j=0 j=0 j=0 j=0 

= N + sM(t)N - sM(t)N 
= N, 

in which case Fj(t) = (1 + s(j — M(t)))/N. 

There are three ways that the number of type j individuals could change at time t: 

1. If j > 1, a type j — 1 individual could acquire a jth mutation at time t. This event happens 

at rate ). So that our formulas hold also when j = 0, we adopt the convention 

that A_i(t) = 0 for all t > 0. 

2. The number of type j individuals could increase by one because of a birth. This happens if 
one of the N — Xj{t—) individuals that is not type j dies at time t, and the new individual 
born has type j. Because each individual dies at rate 1, and when a death occurs, the 
probability that a type j individual is born is Xj(t—)Fj{t—), the rate at which new type j 
individuals are born is {N — Xj{t—))Xj{t—)Fj(t—). We define 

B^it) = {N - X^{t))F^{t), (4.1) 

which can be interpreted as the rate at which a particular type j individual gives birth 
following the death of an individual with a different type. 

3. The number of type j individuals could decrease by one because of a mutation or death. 
The rate at which one of the type j individuals acquires a (j + l)st mutation is ^Xj{t—). 
The rate at which the number of type j individuals decreases due to a death is given by 
Xj{t—){1 — Xj{t—)Fj(t—)) because there are Xj{t—) type j individuals each dying at rate 
one, and when a death occurs, the probability that the new individual born is not a type 
j individual is 1 — Xj(t—)Fj{t—). Thus, the total rate of events that reduce the number of 
type j individuals is fiXj{t—) + Xj{t—){1 — Xj{t—)Fj{t—)). We define 

Dj{t) = ^ + 1 — Xj{t)Fj{t), (4-2) 

which can be interpreted as the rate at which a particular type j individual either acquires 
a mutation or dies and gets replaced by an individual with a different type. 

Let Xj{t) be the number of times in [0,t] that the number of type j individuals increases by one. 
Let Xj(t) be the number of times in [0, t] that the number of type j individuals decreases by one. 
Then Xo{t) = N + X^{t) - X^{t) for alH > 0, and Xj{t) = A}(t) - Xf{t) for all j G N and t > 0. 
From the rates obtained above, we see that if we dehne 

Wht) = Aj'(t) - [ {nXj_i{u) + Bj{u)Xj{u)) du (4.3) 

Jo 

and ^ 

Wfit) = Xf{t)- [ Dj{u)Xj{u) du, (4.4) 

40 
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then the processes > 0) and > 0) are martingales for all j G Z"*". Therefore, 

if we define Wj(t) = Wj{t) — Wf{t) for all t > 0, then the process > 0) is a martingale 

for all j G Z"*". Let AWj{t) = Wj{t) — Wj{t—). Because the process Wj is locally of bounded 
variation, the quadratic variation is given by 

[Wjm= ^ AW,{uf = X^{t)+Xf{t) 

ue[o,t] 

(see (8.19) of [l5]). Because Wj + W^, being the sum of two martingales, is a martingale, we get 
(see Definition 8.22 of m) 

{Wj){t) = [ {nXj_i{u) + Bj{u)Xj{u) + Dj{u)Xj{u)) du. (4.5) 

Jo 

We will work primarily with a different martingale. For all t > 0 and j G Z"*", let 

G*{t) = Bj{t) - Dj{t) = NFj{t) 

As long as every individual’s fitness is strictly positive, we have 

G*{t) = N • 1 + _ 1 _ ^ _ M{t)) = G,it), (4.6) 

where Gj{t) was defined in (j.S.lip . We interpret G*{t) as the growth rate of the type j population 
a time t. In Proposition 14.11 below, we define a martingale that will be very useful for studying 
how the number of type j individuals evolves over time. This is similar to the martingale studied 
in section 4 of [8]. 

Proposition 4.1. For all t > 0 and j G Z"*", let 

Jo 

Then {Zj(t),t >0) is a mean zero martingale with 
Xai {Zj{t)) = E 
Proof. For t > 0 and j G Z"*", define 

Ij{t)=e~^o(^*iG)dv_ 

The processes Xj and Ij are both semimartingales, so the Integration by Parts Formula (see 
Corollary 8.7 of [15]) gives 

Ij{t)Xj{t) = Ij{0)Xj{0) + [ Xj{u-)dlj{u)+ [ Ij{u-)dXj{u) + [Xj,Ij]t. (4.9) 

Jo Jo 

Because the processes Xj and Ij are locally of bounded variation, and the process Ij has contin¬ 
uous paths, we have (see (8.19) of [T5] l 

\Xj,Ij]t = t) for all t a.s. (4-10) 


- 2 /;g‘(D (u) + BAu)X, 
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Also, 


f Xj{u—)dlj{u) = — f Xj{u)G*{u)Ij{u) du. 
Jo Jo 


(4.11) 


Because 


Xj{t) = Xj{0) + x]{t) - xf{t) = Xo{t) + Wj{t) + / (^Xj_i(u) + G*{u)Xj{u)) du 

Jo 

and Ij{t) is a continuous function of t, we get 

f Ij{u—)dXj{u)= f Ij{u){fj,Xj-i{u) + G*{u)Xj{u)) du + f Ij{u) dWj{u). 

Jo Jo Jo 

Combining (14.91) . ()4.10l) . (|4.11ll . and ()4.12l) and using that lj{0) = 1, we get 

Ij{t)Xj{t) = Xj{0) + [ Ij{u)fLXj_i{u)du+ [ Ij{u)dWj{u). 

Jo Jo 

Therefore, in view of (|4.7I) and (|4.8I) . we have 

Zj{t)= [ Ij{u)dWj{u). 


(4.12) 


Note that Dj{t) < 1 + /U for all t. Also, because 0 < Fj(t) < 1 for all t, we have Bj{t) < N for 
all t, and so the process {G*{t),t > 0) is bounded. Therefore, using (14.51) . for each fixed t > 0, 
we have 

rt 


E 


[ l]{u)d{Wj){u) =E [ e-^^o^U'’)<^^[^Xj_i{u) + Bj{u)Xj{u)+Dj{u)Xj{u))du 

Jo 1 [Jo 


< oo. 


Therefore (see Theorem 8.32 of m ), the process {Zj{t),t > 0) is a square integrable martingale 
and 

rt rt 

{Zj){t) = / l]{u) d{Wj)iu) = / e-2/o G*(v) + Bj{u)Xj{u) + Dj{u)Xj{u)) du. 

Jo Jo 

Because Zj{0) = 0, the process {Zj{t),t > 0) is a mean zero martingale. Finally, because 
Var(Zj(t)) = E[Z‘^{t)] = E[{Zj){t)] (see Corollary 8.25 of [E]), the result follows. □ 


4.2 Generalizations 


It will often be useful to consider the martingale of Proposition 14.11 started or stopped at a 
stopping time. Let {Et)t>o be the natural filtration of the process {{XQ{t), Xi(t), . ..), t > 0). Let 
r be a stopping time with respect to {Et)t>o- Let XJ(t) = Aj(t Ar) and ZJ(t) = Zj{t Kt) for all 
t > 0. Then the process {{Xq (t), XI (t), X2 {t), ■ ■ ■),t > 0) represents the population modihed so 
that it does not change after time r. Because stopped martingales are martingales, the process 
{ZJ{t),t > 0) is a martingale with {ZJ){t) = {Zj){t A r), and we have the following corollary. 

Corollary 4.2. Let t be a stopping time, and let ZJ{t) = Zj{t A r) for all t > 0 and j E Z"*". 
Then {ZJ(t),t >0) is a mean zero martingale with 


Yav{Zj{t)) = E 



e-2 fo G* (v) dv + Bj {u)Xj (u) + Dj {u)Xj (u)) du 
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Also, the process {{Xo{t), Xi{t), X 2 {t ),... ),t > 0) is a Markov chain on the countable state 
space S = {(xo, xi,...) : Xj E for all j and ^3 ~ therefore satisfies the Strong 

Markov Property. Combining Corollary 14.21 with the Strong Markov Property leads to the fol¬ 
lowing result. 

Corollary 4.3. Let k and r be stopping times with k < t. For all j E Z"*", let Zj’^(t) = 0 if 
t < K, and if t > K, let 


/*i At 

Zf^{t) = e" ^^Xj{t At)- du - Xj{K). 

J K 


Then {Zj’'^{k t),t > 0) 

is a mean zero martingale with 


Var(Z;’A« + i)l-^K) = ^ 

■ p{K+t)/\T 

/ e“2 iiiXj_i{u) + Bjiu)Xj(u) + Dj{u)Xj(u)) du 

. J K 



Also, we will sometimes need to consider the type j individuals that are descended from an 
individual that gets its jth mutation during some time interval. The following result is established 
in the same way as Proposition 14.II and Corollary 14.31 except that the mutation rate is set to zero 
outside of the time interval (k, 7 ]. 


Corollary 4.4. Let k and 7 be stopping times with k < 7 . Let j E Z"*". For t >0, let Xj^’'^\t) be 
the number of type j individuals in the population at time t that are descended from individuals 
that acquired a jth mutation during the time interval (k, 7 ]. Let zY''^\t) = 0 if t < k. If t > k, 
let 


{t) = e" ^3 _ / ^Xj_i{u)e- ^3du. 

J Ks 


Then {Zj^’'^\k + t),t > 0) is a mean zero martingale. Denoting by B^^’'^\t) and D^^’'^\t) the 
expressions on the right-hand sides of (EZP and with x'^p'^\t) in place of Xj{f), we have 


Var(zf’^'(K + t)|T«) 


= E 


/ + B'X\u)xfAn) + du 

J K 


Tn 
(4.13) 


IS a mean zero 


Furthermore, if t is a stopping time with k < t, then {Zj^’"^\{k + t) A r),t > 0) is 
martingale, andXav{Zj^’'^\{K + t) AT)\Ef^) is obtained by replacing n + t with {K-\-t)AT in ^.13^ . 


Remark 4.5. By the Strong Markov Property, the result of Corollary 14.41 holds even if j is 
random, as long as j is TK-measurable. 


4.3 A related supermartingale 

We will also need to consider a supermartingale that involves not just the individuals of type j 
but the individuals of all types less than or equal to j. For j E Z+ and t > 0, let 

S,{t)=Xo{t) + Xi{t) + ---+X,{t). 

There are two ways that the value of the process Sj could change at time t: 
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1. The number of individuals with j or fewer mutations could increase by one because of a 
birth. This happens when one of the N — Sj{t—) individuals with more than j mutations 
dies and is replaced by an individual with j or fewer mutations. Because each individual 
dies at rate 1, and when a death occurs at time t, the probability that a type i individual 
is born is X^{t—)Fi{t—), the rate at which this occurs is 

{N - Sj{t-))Y,X,{t-)F,{t-). (4.14) 

£=0 


2. The number of individuals with j or fewer mutations could decrease by one because of a 
mutation or death. The rate at which one of the type j individuals acquires a (j + l)st 
mutation is There are Sj{t—) individuals with j or fewer mutations that could 

die, and when a death occurs, the probability that the new individual born has more than 
j mutations is 1 — Therefore, the total rate of events that reduce the 

number of type j individuals is 

-Y^Xi{t-)F,{t-)\ +^iXj{t-). (4.15) 

^ e=o ^ 

Let 

j 

Vjit) = Nj2Mt)Fe{t) - S,it) - ^iXj{t), 

£=0 

and note that the difference between the expressions in (I4.14h and (I4.15h is Vj{t—). 
soning as in the argument following (j4..Sp and (j4.4p . the process {Sj{t) — fQVj(u) du, 
martingale. This leads to the following proposition. 

Proposition 4.6. For all j E Z"*" and t >0, let 

and let Yj{t) = e~Sj{t). Then {Yj{t),t > 0) is a supermartingale for all j E Z"*". 

Proof. Lemma 3.2 in Chapter 4 of [9] states that if {X{t),t > 0) is a process which takes its values 
in a complete separable metric space E and is adapted to {Ft)t>o, and if / : Li —)• R and g : E ^ R. 
are bounded measurable functions such that inf^, &Ef{x) > 0 and {f{X{t))-f^g{X{u))du, t > 0) 
is a martingale with respect to (T))t>o, then the process whose value at time t is 

/(X(i))exp(-^ ttm"") 

is a martingale with respect to {Ft)t>o- We can apply this result with X{t) = (Xo(t),Xi(t),...), 
f{X{t)) = Sj{t) + rj where rj > 0, and g{X{t)) = Vj{t)l{Sj{t)>o} to get that if 

y;(t) = {Sj{t) + 7 ?) exp ^ > 


Thus, rea- 
t > 0) is a 
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then > 0) is a martingale. Note that Fi{t) < Fj{t) for all (. < j. Therefore, 


V-/J- _ ---...cv-/ _ -J 


V,it) 




e=o ^ 


Sj{t)+v 


Sj{t) + v ^QSj{t)+r] 

If 0 < u < t, then for all rj > 0, 

^|^g“/o _j_ Jo ('y)+^))l{Sj(D)>o}'^'''| 

(t')\F ]e~i^ji'^)~i'^j('^)/iSji'^)~‘^'’l)))^{Sj(v)>o} 

= {u)e~ (r)/ {Sj iv)+v))Y{Sj {v)>0}dv 


(4.16) 


Wj 

dv 


Letting ?? —)> 0, we get 


E[e <e fo Gj(^n{Sj(v)>o}dv 


(4.17) 


Because Sj(t) = 0 whenever Sj{v) = 0 for some v < t, the indicators on both sides of (|4.17l) can 
be removed. It follows that E[Yj{t)\Fu] < Yj{u). That is, {Yj{t),t > 0) is a supermartingale. □ 


Remark 4.7. As long as Gj{t) = G*{t), since we are assuming N is large enough that s > /r, 
for all i < j we have 


NFi{t) -l = G}{t) + fi = Gi{t) + fi = s{e- M{t)) < s{j - M{t)) -/j = Gj{t), 
and therefore Gj{t) = Gj{t). 


5 Proof of Proposition 13.1 

In this section, we study the behavior of the process before the time t* defined in (13.6p . We 
prove Proposition 13.11 Recall the definitions of k^, k~^, k~^, and k* from (II.lip . (13.2p . (13.3p . and 
p3.4p . Part 1 of Proposition 13.11 savs that for j < k^, the number of type j individuals at time 
t € [0, t*] is well approximated by Xj{t), which is defined in (13.71) . Part 2 handles the delicate 
case in which there is an integer j in the interval {k^,k~l^). Parts 3 and 4 say that for j > k'j^, 
no type j individuals appear before time t*, and there are fewer than s/iJ, individuals of type k* 
through time t*. 

5.1 Bounding the mean number of mutations 

Before time t*, the mean number of mutations in the population is close to zero. Accordingly, 
let rj = ^k%/s, and define the stopping time 

T = inf{t : M{t) > r]}. 

Recall the definition of the martingale {Zj{t),t > 0) from (14.70 . We will consider the processes 
{XJ{t),t > 0) and > 0), where XJ{t) = Xj{t A r) and ZJ{t) = Zj(t A r) for all t > 0. 

From assumption A3 and (II.Sp . we see that for all o > 0, 

(sk^Y —)• 0 as iV ^ oo, (5.1) 

s 
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so in particular r/ —>■ 0 and sr/ —)• 0 as —?> oo. Therefore, we may and will assume throughout 
this section that N is large enough that st] < 1. This implies that the fitness of every individual 
is strictly positive before time r, which means Gj{t) = Gj{t) = s{j — M{t)) — /i for all j € Z"*" 
and t < T. Our first goal is to show that with high probability, we have t > t*, and so stopping 
the process at time r does not change the behavior of the process before time t*. To do this, we 
need the upper bound on E[XJ (t)] provided by the following lemma. This lemma will also be 
useful for first moment estimates later in the proof. 

Lemma 5.1. For all t > 0 and j € Z"*", we have 


E[X]{t)] < 


- 1)^' 

s^jl 


(5.2) 


Proof. Let t > 0. Let m G N, and for i G {0,1,... ,m}, let ti = Let be the 

number of type j individuals at time t that are descended from individuals that acquired their 
jih. mutation during the time interval The process (z|**’**+^^(tj + t),t > 0) introduced 

in Corollary 14.41 is a mean zero martingale. The process stopped at time r is also a mean zero 
martingale, so 


E 




= E 


J 


■ti+lAr 


[iXf._^{u)e 






Now sj — sr] — n < Gj{v) < sj for all v G [0,r), which implies that 


E[xf^’^^+^\t At)] < 


g Gj (v) dv y[ti 


A ' 


(t At) 


At 

-Jti 

< e(^^+G(A+i-ti) f f^e^Et-u)^^xj_^{u)] du. 
Jti 


Summing over i G {0,1,... , m — 1} gives 


E[x: 


(t)] < j\e^^^^-^^E[X]_^{u)] du 


and then letting m ^ oo gives 


E[XJ{t)] < 



^e^Ft-n)E[xj_^{u)] du. 


(5.3) 


We now use (j5.3p to prove (15.2p by induction. Because XQ{t) < N for all t > 0, we have 
£'[Xg(t)] < N for all t > 0, which establishes the result when j = 0. Suppose j > 1 and ()5.2p 
holds with j — 1 in place of j. Then by (15.31) . 


E[XJ{t)] < [ ^e' 
Jo 


„sj{t-u) _ 




i-1 


Hj - 1)! 


(e*“ - 1)^-^ du < 




sjt 


i(j - 1) 


d 


^-sju^^su _ du. 


Because 

^-Sju^^su _ 

the result follows by induction. 



(1 - e-’^y 
sj 


(5.4) 

□ 


26 














Lemma 5.2. We have lim P{t < t*) = 0. 

N^oo 

Proof. If T <t*, then M{t* At) = M{t) > rj, so by Markov’s Inequality, 


By Lemma l5.11 


P{t < t*) < P{M{t* A r) > r/) < 


E[M{t* A r)] 
rj 


^ LXJ 

£;[M(r Ar)] = - 

i=i 



where y = (/i/s)(e^** — 1). Recalling ()3.6p . we have y < (/r/s)e^** < {y/s)kjf. Therefore, in view 
of (|5.ip . we have y ^ 0 as N ^ oo, and thus ye^ < 2y for sufficiently large N. Using (|1.6p . for 
sufficiently large N, 


P(t <t*)< — < 
V 


2{y/s)k% 

{y/s)k% 


as N ^ oo. 


as claimed. 


□ 


5.2 Controlling the fluctuations in X, 

Our goal in this subsection is to obtain sharp bounds on the fluctuations of the number of type j 
individuals before time t*. Because the randomness can be expressed in terms of the martingales 
Zj, the key result is the next lemma, which will provide control on the value of \Zj(t)\. Before 
stating this lemma, we establish a simple bound on the birth and death rates that will be useful 
throughout the paper. Note that for all t such that all individuals at time t have a strictly positive 
fitness, and in particular for all t < t, we have 

Bjit) + h)j[t) = [N — 2Xj(t))Fj(t) + 1 + /i 

_ (X-2X,(t))(l + s(j-M(t))) , ^ , 

N + 

< 2 + sj + /i. 

Because sk^ —)• 0 as X ^ oo by (|3.5I) and assumption A3 and /i —>• 0 as X —>■ oo, we have for 

j < k^, 

Bj{t) + Dj{t) < 3 for sufficiently large X. (5-5) 

For future reference, note that (15.5p also holds for all j < J = Sk^T + A;* + 1. 
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Lemma 5.3. Let e > 0. For sufficiently large N, we have 


p[ sup \ZJ{t)\ < 16W ^ for all j < k£\ > 1 - 

Proof. By Corollary 14.21 the process {ZJ{t),t > 0) is a mean zero martingale. Since G*(t) = Gj{t) 
for t < r, we have 


Var(Znr)) = E 


rt*AT 


LJO 


e-2 So Gj (v) dv + Bj{u)Xj(u) + Dj{u)Xj(u)) du 


Combining this result with (15.5p and Lemma l5.11 we get 


Var(ZJ(C)) < r 
io 


sffil 


V Hi -1)! 


du 


< e 


2(sri+ii)t* _ 


sffil 


10 


^-2sju(^^^su _ ly-igj + 3(e*« _ i)i) du 


< ^ + 3e-*^“) du. 

sffil Jo 


For j < k^, the result ()5.ip implies that 


(5.6) 


2{sri + ffit* < 2{fik% + /^) ( - log /cat ) —>• 0 as N ^ oo 


(5.7) 


and therefore ^ 1 as iV —?> oo. Also, as a consequence of assumption A3, we have 

e-s{j+i)ugj _|_ ^Q-sju < -)- 3 —i, 3 as iV —)• oo for all u E [0,t*]. Thus, for sufficiently large N, 


Var(ZJ(r)) < 


ANffit* 

sffil 


(5.8) 


for all j < /cH. By the Maximum Inequality for martingales. 


P 


sup 

t£[0,t* 


\ZJ{t)\>16. 


‘ N ffit*kN 


< A\ar{Zf{t*)) 


esffil 


< 


256NiiH*k]\r 16kN 


(5.9) 


for all j < kjf if N is sufficiently large. Since few —>■ oo as iV ^ oo by (11.61) and fe)^ — few —^ 0 as 
A" ^ oo by (13.51) . we have (fe^ + l)/few —>■ 1 as A —>• oo. The result thus follows from (j5.9p by 
taking the union over j E {0,1,... , [fe)^J}. □ 

The next lemma shows that when the processes Zj are bounded as indicated in Lemma 15.31 
the processes Xj will stay fairly close to the deterministic functions Xj dehned in (13.71) . Because 
the difference between Xj and Xj depends in part on the difference between Aj_i and Xj-i, 
the proof proceeds by induction. Rather precise bounds are needed to prevent the errors from 
accumulating too rapidly during the induction process, so some technical work is required to 
obtain sufficiently sharp estimates. 
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Lemma 5.4. On the event that t* < t and 


sup \Zj{t)\ < 16W ^ /or all j < 

te[o,t*] V 


(5.10) 


we have, for all t G [0,t*] and i <k 


N’ 


\Xe{t) - Xi{t)\ < Xi{t)( {i + l){sr] + fi)t + 16^ .^, (1 - e ^ 


i=o 


eNfi^jl (£-/)! 


(5.11) 


In particular, for all £ < k^, we have 


sup |X^(t)-x^(t)| < xe{t*)({£+l){sr] + p)t* + 16'^J ^^ .^, (1-6 "**) (5.12) 

te[o,t*] V y^-dy- J 

Proof. Throughout the proof, we will assume that t* < t and that (15.101) holds. This implies 
that G*{u) = Gj{u) for u <t*. For j < k^ and t G [0,t*], define 

Hj{t) = {sri + p)txj{t) + 

We will first show by induction that for £ G {0,1,..., and t G [0,t*], we have 

|X,(t) - w(t)| < Ht(t) + j) ,, , '"‘~y f‘ du. (5.13) 

j=o y^~ 3 ~ -‘-F® Jo 

Consider first f = 0. Suppose t G [0,t*]. From (14.7p . we get Xo(t) = e-^ + Zo(t)). 

Because —srj — p < Gq{v) <0 for all v G [0, t], it follows that 

\Xo{t)-N\< N{1 - efoGoiv) dv^ + \Zo{t)\< N{srj + p)t + 16^^^. 

Therefore, since xo(t) = N for all t G [0,t*], we have 


\Xo{t) — xo(t)| < N{sr] + p)t + 16 


Nt*kN 


= Ho{t), 


so (j5.13l) holds for £ = 0. 

Next, suppose (I5.13|) holds for i — 1, where i >1. Let t G [0,t*]. Equation (14. 7p gives 
Xe{t) = e^o GdO dv^J^ /“ Ge(v) dv ^ 

= r pXt_i{u)e^l GiO) dv ^ g/,* GPv) 

Jo 

= [ p{Xe_iiu)-xe-i{u))eJ^^^^^'y^^ du+ [ pxe-i{u){eJ^^^‘^^'y du 

Jo Jo 

ft J 

+ / pxe-iiu)e^^J-'^Uu + eJoGdv)dv^^^^y ( 5 -^ 4 ^ 

Jo 
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Equation (15.41) gives 




du = Xi{t). 


(5.15) 


Also, \efuGe{v) dv _ ^si{t-u)^ ^ ^si{t-u)^^- Jl{sM{v)+^l) dv ^ Therefore, 

(j5.15j) implies that 

ft j ft 

/ du < (sr? + //)t / du = (sry+ /i)txf (t). (5.16) 

do do 

Furthermore, because e-^o and we are assuming that (|5.10p holds, 

Combining (I5.14E (I5.15E (I5.16p . and (|5.17p leads to 

|X^(t) - xi{t)\ < Hi{t) + [ fL\Xe-i{u) - X£_i(tt)|e*'^(*“'“^ du. 

do 

Using the induction hypothesis to bound the integral, we get 

ft f ^-2 


(5.17) 


m - Xf{t)\ < He{t) + [ fdd,_i(u) + 

■^0 V ^.=0 




e-i-j 




X £ - e-^(“-^))^-l-2 dv^ du. (5.18) 

The first term /g(u) du in the integral on the right-hand side of (|5.18l) matches the 
j = i — 1 term on the right-hand side of p5.13p . For j G {0,1, ...,£— 2}, the term corresponding 
to j in the sum on the right-hand side of p5.18p can be expressed as 


/ 


fte' 


si{t—u) 




l-l-j 


(£-j - 2 )!s^-i -2 
T-i 7* 


'ddj(u)e*(^-^)(“-^)(l - e-*(“-’'))^-^-2 du ) du 




(£-j- 2 )!s^-i -2 


e-"(“-^)(l - e-*(“-^))^-^-2 du^ dv, 


which matches the term corresponding to j on the right-hand side of (I5.13P because the substi¬ 
tution X = u — V combined with p5.4p gives 


f e-*(«-U (1 _ ^-s(u-v)Y-j-2 ^ _ ^-sxY-j-2 ^ 

Jv do 


(1 _ g-s(t-U)7-i-i 

s{e -j-i) 


Thus, by induction, p5.13p holds for all £ G {0,1,... , [k ^\} and t G [0, t*]. 

Next we will obtain (I5.11h from (I5.13p . For j G {0, !,...,£ — 1}, the term corresponding to j 
in the sum in (I5.13P can be written as 




e-j 


{i — j — l)!s^ 


J £ £r)+fd)uxj (u) + (1 - du. (5.19) 
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The first of the two terms in this expression is bounded above by 


(sv + 


iV/ 


j - 1) 


- du. 

! Jo 


By making the substitution x = and y = e®* and then applying the result (3.199) of [12], we 
see that 


[ (e*“ - 1)4"^(*-“)(1 - du = (e"‘ - if 

Jo 

so upper bound on the first term in (15.191) becomes 


si\ 


A[,/(pSt _ lY 

(sr/ + y)t --= {sy + y)t ■ xeit). 


(5.20) 


The second term in (|5.19p equals 

{i — j — i)!s^“'^“^ y £s^j\ 
Making the substitution x = t — u, we get 


• [ e-"(^-^)“(l - e-"(*-“))^-J-i du. 

Jo 


r e-*(^-j)“(l - e-"(*-“))^-i-i du = e"^(e"* - l)^-^'”^ 

Jo Jo 


dx = 


(1 - 


Also, e®^*(l — e~^^y~^ = (e^* — 1)^(1 — 6“®*)“-^ so the second term in (I5.19p equals 

^ '“/ii ■ 

which matches the term corresponding to j in (15.lip . Furthermore, we have 


He{t) = {sy + fj.)txe{t) + l6^^^^^ -e\{l-e ^*) (5.22) 

and the second term matches the j = i term in (I5.11h . Combining the bound in p5.20p with the 
results in ^5.2111 and (I5.22p gives the bound in (15.lip . 

Finally, note that if t E [0, t*] and 0 < j < i < k^, then 

Xi{t){l - = (e^' - l)'(e*** - 1)-^(1 - e-^y-^xeit*) 

= (e"‘ - l)^-^(e"*‘ - l)-^e**%£(C) 

< (e"‘‘ - l)-^e^^*^xeit*) 

= (l_e-‘‘)-ixr(n, 

so (I5.12P follows from (15.lip . □ 
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5.3 Proof of part 1 of Proposition 13.11 


Here we show how the results in the previous section can be used to obtain the desired control on 
the difference between Xj and Xj up to time t* for j < k~^. The result (15.241) below is essentially 
a restatement of part 1 of Proposition 13.11 

Proposition 5.5. Let to = (1/s) log/ cat. For sufficiently large N, we have 

P[\Xj{t) — Xj{t)\ < 6xj{t) for all j < and t G [to,t*]) ^ ^ ~ (5.23) 

and 

p( sup \Xj{t) — Xj{t)\ < 6xj{t*) for all j < kffj > 1 — —. (5.24) 

\te[o,t*] J 12 

Proof. It follows from Lemmas 15.21 and 15.31 that the probability that t* < t and (15.1011 holds is at 
least 1 — e/12 for sufficiently large N. Thus, the proposition will follow from Lemma f5. 41 provided 
that for sufficiently large N, we have 


{£ + l)(sr? + ff)t + 16 ^ 
i=o 


sH*kN 


P. 


(1 - e-"')-l < ,5 


eNffjl 


(5.25) 


for all t G [toT*] S'!!*! (. < kjj. It will suffice to show that the two terms on the left-hand side of 
(I5.25P each tend to zero as —>■ oo uniformly in I <kfj and t G [toT*]- Th® first term tends to 
zero by the reasoning in (15.7p . so it remains to consider the second term. 

For j < i < kff and t* > (1/s) logkN, we have (1 — < (1 — e as N ^ oo. 

Therefore, for sufficiently large N, we have (1 — e~^^)~^ < 3. It now follows from the Binomial 
Theorem that 


e 


E 


sH*kN P 
eNffjl {£-j)l 


(1 - e-"*)"^' < 3 




3 






l 


for sufficiently large X. To show that this expression tends to zero as —?■ oo for all i < /cj^, it 

suffices to show that 


We use o{kN) to denote a term which, when divided by tends to zero as A^ —)• oo and 0(1) 
to denote a term that stays bounded as N ^ oo. Because n\ n Stirling’s 

Formula, we have 

logk^l = + ^ogkff - k- + 0{1) = kff log kf^ - kff + o{kN). (5.27) 

Also, because s//U—>-ooasA^—>-ooby (II.8p . 
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and because assumption Al implies that 


kN 


lim , , , , 

N-yoo log(l/s_, 


= OO, 


(5.28) 


we have 


logt* =loglogfeAr + log(l/s) + 0(l) =o{kN). 


Finally, note that kj^ log kj^ = kN log kN + o{kN)- Therefore, the logarithm on the left-hand side 
of (I5.26P is 


log t* + log kN - log e - log + log W + log 1 -h 


S \ N 


^(^-logN + k^ log kj^-k^ + log ^ o(fcAr) 


= i ( - log iV -h log k^-kj^ + log N - 


log N 


log 


log(s//i) \\og{s/fi) 


logiV 


= ~2^n + o{kN), 

which tends to —oo as N ^ oo. The result follows. 


+ o{kN) 


(5.29) 

□ 


5.4 Proof of part 2 of Proposition 13.Il 

In this subsection, we consider the case in which there is an integer j G (/c)^, k'^). As noted before 
the statement of Proposition 13.11 for sufficiently large N there can be at most one such integer, 
so we will assnme that N is large enough to ensure this. Also, such a j may not exist for every 
N, so in this subsection asymptotic statements as N ^ oo should be understood to mean that 
we consider a subsequence of integers tending to infinity such that there is an integer in 

(fc)^,, A:)J),) for all i. 

Recall that we can write j as in ()3.9|) . with —1 < bj < 2, and dj = max{0, 6j}. Recall also 
that when such a j exists, we have t* = (4/s) log/cjy. In this case, we can not use the same 
argument as in the proof of Part 1 of Proposition 13.11 because the expression in (I5.29|] does not 
tend to —oo as oo if is replaced by A://. Instead, we will break the type j individuals 

into three subpopulations. Define the times 

fn (dj + 1) log Ajat 

( s J s 

Note that 0 < ri < r 2 < t*. For each type j individual in the population, we can consider the 
time when this individual or its ancestor acquired its jth mutation. For t G [0,t*], using the 
notation of Corollary 14.41 we can write 

Xj{t) = + X^p'^*\t). (5.30) 

Here we are dividing the type j population into three groups, depending on whether the jth 
mutation occurred before time ri, between times ri and r 2 , or after time r 2 . We will consider 
these three snbpopnlations separately in the next three lemmas. 
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Lemma 5.6. We have 


lim = 0 ) = 1 . 

N^OO ^ 


Proof. Clearly = 0 for all t G [0,f*] when ri = 0, so we will assume that ri > 0. Each 

type j — 1 individual is acquiring mutations at rate /r. Therefore, by Lemma l5.ll the expected 
number of times, before time ri A r, that a type j — 1 individual acquires a jth mutation is at 
most 


j\E[XU{t)\dt<^’ 


- 1 ) 


pr 

i/o 


- 1)^-^ dt. 




(5.31) 


We have 


rri 

/ ^ 

Jo 


- 1)^'-^ dt < r dt < 

Jo 


sjri e~‘^^ 


SJ S] 

Therefore, for sufficiently large N, the expression in (I5.3ip is bounded above by 


sij\ 


By Markov’s Inequality, this expression also gives an upper bound for the probability that at 
least one type j — 1 individual acquires a jih. mutation by time ri A r. Using (j3.9p . the reasoning 
in (I5.27p . and the fact that 


j log j = j log kN + o{kN) = kN log kN + o{kN), 


(5.32) 


we get 


log 




,i(6j+l) 2i 


'N 




S^j\ 


log N - j log j + {hj + l)j log kN - 2j - log j! 

-bjkN log kN + [bj + l)j log kN - 2j - (j log j - j) + »(%) 
-j + o{kN), 


which tends to — oo as Ai —>■ oo. Thus, the probability that some individual acquires a jth 
mutation by time ri A r tends to zero as oo. Combining this observation with Lemma 15.21 
gives the result. □ 


Lemma 5.7. For sufficiently large N, 


P 


xp^\f) > I < A- 


12 


Proof. First, suppose bj > 0. By applying the argument that leads to (j5.3p followed by the result 
of Lemma 15.11 and then (15.41) , we get 


pT2 

E[x'^p^^^\t* A r)] < / E[X]_^{u)] du 

J n 


sjt* 


^ Nyi^e 
- si-\j-l)\ 


rr2 


^-Sju^^su _ ^y-1 


fri 


s^j\ 


(l_g-^r-2)j _ _ 


(5.33) 
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Since ^(1 — e = j(l — e ^^y ^se < sje and r 2 — ri = 2/s if N is sufficiently large, 
we have 

(1 - e-^^^y - (1 - < (ra - ri)sje-""i < 2e^ (5.34) 

Since j'/A^Ar —)> 1 as iV —)• oo and 




,sjt* 


(5.35) 


it follows from (j5.33p and (j5.34p that for sufficiently large N, we have 
^[^[r-l,r 2 ](^* A r)] < 15iV/i^g -1)^ _ ^-6, ^ 

When bj < 0, we can use instead Lemma Oto get A r)] < ^[XJ(t*)] < Xj{t*). 

Combining these results gives 

^T)] < 


for sufficiently large N. By Markov’s Inequality, 


p[x'^P'''^\t* At) > ^kj/^Xj{t*'^ < 


eE[X^p''^^\t* At)] ^ ^ 
Xj{t*) 


The result now follows from Lemma 15.21 


□ 


Lemma 5.8. There exist positive constants c and c', not depending on e, such that for sufficiently 
large N, 


p{ckff^x,{n < xp’^^yp) < c'k^^^xjin) > i - 


, —dn 


(5.36) 


Proof. For t E [r 2 ,t*], write 

Z^p'^*\t) = e~ X^p 

as in Corollary 14.41 Then 
X 


Jr2 


- / /xX,_i(u)e 


- r G*{v)dv 

Jro .1 ^ 


du 


At)= f /rXj_i(u)e^- g;{v) dv ^ J/P" Gfv) dv^[pr]^^* ^ ^5 37) 

J 7*2 

Note that r 2 > (1/s) log A: 7 v- Assume for now that t > t* and that the event in (I5.23P holds so 
that, in particular, 


(1 — 6)xj-i{t) < Xj-i{t) < (1 + (5)xj_i(t) for all t € [r 2 ,t*]. 


(5.38) 


Using (15.4p . 


rt* Npe^^^* p* 


>ri 




e-sju^^su _ ^y-L 


’r2 


N^e 


sjt* 


sp\ 


\{l-e-^^*y -{I-e-^P^) 


s^j\ 


1 - 




- 1 - 


ft-AT 


(5.39) 
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We need to consider the asymptotic behavior of Hn = (1 — — (1 — as N ^ go. 

First suppose hj < 0. Note that ^(1 — xY = —j{l — xy~^. Therefore, using ~ to denote that 
the ratio of the two sides tends to one as N —oo, we have 

hN < - k]Y) < ~ (5.40) 

and 

hN > j(l - ~ e~^kY'^\ (5.41) 

Combining (j5.35F (j5.38F (|5.39p . (|5.40p . and (I5.4ip . we get that there are positive constants ci 
and C 2 such that for sufficiently large N, 

nt* 

cikjY^^Xj{t*) < / |uXj_i(?x)e*-^^* dn < C2A:^'^^Xj(t*). (5-42) 

J r2 

In view of p3.1h . the constants ci and C 2 can be chosen so that the equation holds for all allowable 
values of 6. Also, using (I5.38P and then reasoning as in pS.lOp . we get 


0 < 


- e^^(**-“)) du < (1 + d)( 


•sr? -I- 


(5.43) 


Since (1 + 6){sr] + ^)t*kj^ —)• 0 as N ^ oo by the reasoning in p5.7l) . it follows from p5.42l) and 
P5.43p that there are positive constants C 3 and C 4 such that for sufficiently large N, 


cskj/^Xj 


(t*) < f nXj_i{u)e-^^ ''^G;{v)dv < c^k/^Xjit*). (5.44) 

Jr2 


We still need to control the second term on the right-hand side of p5.37p . which requires 
bounding \t* A r). By Corollary 14.41 


Var(zf^’‘‘'(C At)|J;2) 


j 

= E 


■ rt*/\T 

- J r2 


g- 2 /“ Giiv) + B]."^’**](n)Aj"^’**]( 7 r) + dp’^*\u)X^p’^*\u)) du 


We now take expectations of both sides of this equation. Using that ^(u) < Xj(u) for 

u < T, that ^(rt) + Dp\u) < 3 by the reasoning that leads to p5.5p . and that Lemma [5T] 




r2 


holds, we get for sufficiently large 

r 

.E[Var(z]’''’**'(t* A t)| < E 

<-l 

J V 


e ^^“2 ‘^''(/rAJ_i(n) + 3X]{u)) du 


.2isj-s,-,)iu-r2) ^ - 1 )^ 

sfj! 

r 


I si-\j - 1)! + 

< ^2{sn+t.)t* ^2sjr2 . PjP f ^-2sju(^^^su _ ly-^gj + 

J • Jr2 

< ^ 2 {sv+uX e 2 sjr 2 . py. r (e-"(j+i)“sj + 3 e-"^“) du. 

J ■ Jr 2 
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Reasoning as in the derivation of (I5.8p from ()5.6p . we have e < e ^“sj+3) 

for u > r 2 , so for sufficiently large N, 


i?[Var(4^’**](r At)|J-,J] < 


(5.45) 


Note that if y is a random variable and ^ is a cr-field such that ill[y|^] = 0, then by the 
conditional Chebyshev’s Inequality, 


P{\Y\ >a) = E[P{\Y\ >a\g)] <E 
Therefore, (|5.45p implies 


Var(y|g) 


.B[Var(y)|g] 


Pi Ar)| > 


U8NfiH*e^p2\ ^ e 

es^fj! y - 12 ' 

In view of (I5.35p . when the event in (15.461) holds, for sufficiently large N we have 


(5.46) 


Gj{^)dv.^[r2,t 


■i(r Ar)i < 


Combining this result with (15.371) and (I5.44|) , we get that when equation (15.381) and the event in 
p5.46p hold and when t > t*, we have 

(c3 - yN)k^^'Xj{t*) < X^p'^ \t*) < (c4 + yN)kp^'xj{t*) (5.47) 

for sufficiently large N, where 

^ \/ AT 1 

In view of Lemma 15.21 Proposition 15.51 and equation p5.46p , the result will follow if we can show 
that un ^ 0 as N ^ oo. To show this, we make a calculation similar to the calculation in the 
proof of Part I of Proposition 13.11 Noting that and using (|5.27p . p5.28p . 

and P5.32I) . we get 


U9sij\t*e-^P^ dj 
Kn ■ 


log 


I49s^j\t*e-^P2 jdj 




= ^ ^ log 49 + j log + logjl + log t* - (dj + I)j log kN - log e - log + dj log /cat 

= ^ log + j logj -j - (dj + l)j log kN - log + o{kN) 

= ^ ^ log iV + bjkN log kN + kN log kN - j - [dj + l)A:Ar log kN - log + o{kN) 
j ^ [bj - dj)kN log kN 


“"2 + 


-o{kN), 


which tends to —oo as N ^ oo. Thus, t/at —>■ 0 as iV —>■ oo, which completes the proof. 


□ 
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Combining Lemmas 15.6115.71 and 15.81 and using (|5.30p , we arrive immediately at the following 
result, which is essentially part 2 of Proposition 13.11 

Proposition 5.9. There exist positive constants Ci and C 2 such that for sufficiently large N, 
we have, for all j G {kfj, 

P{Cikff^Xj{t*) < Xj{t*) < C2k-^^x^{t*)) > 1 - |- 


5.5 Proof of parts 3 and 4 of Proposition 13.11 

In this subsection, we complete the proof of Proposition 13.11 We will need the following lemma. 
Recall from (13.4|) that k* = max{j G N : j < k^}. 

Lemma 5.10. We have 

sk*t* 

lim - ,« . — = 0. 

N^oo +^k*\ 

Proof. We have, using the reasoning in (j5.27p . 


log 



k*+l^sk*t* 
k* + l I 


logiV — {k* + 1) log 
-{k* + 1 - /cat) log I 



+ sk*t* 
+ sk*t* ■ 


-\ogk*\ 

k* log k* + k* + o{ki\f). 


(5.48) 


We consider two cases. First, suppose k* + 1 — k^ > 1/2. It follows from assumption A2 
that (/cat log/ cat)/ log(s///) 0 as N ^ 00 . Therefore, the first term dominates the expression in 

(j5.48p . so the expression tends to —00 as N ^ 00 . On the other hand, suppose k* + l — kN < 1/2. 
Then k* < kjy — 1/2, which for sufficiently large N implies that k* < kfj- by (j3.5p . It follows 
that there are no integers in the interval which means t* = (2/s) log ^at. Because 

k* + I > k/f, we have k* + 1 — k]\[ > k/^ — kjsf, so in this case, starting from (j5.48p . 


log - ^n) log + 2k* log kN - k* log k* + k* + o{kN) 

= —2k]\f log kf^ + 2k* log k]\f — k* log k* + k* + o{kN), 


which tends to —00 as —)• 00 because k^ ^ k* as N ^ 00 . The result follows. □ 

The results below establish parts 3 and 4 of Proposition 13.11 Proposition 13.II follows immedi¬ 
ately from Propositions 15.5115.9115.111 and 15.121 

Proposition 5.11. For sufficiently large N, 

P{Xk*it) < s/n for all t G [0,t*]) > 1 “ 

Proof. For t < t, we have Bk*{t) — Dk*{t) = G/.,{t) = Gk*{f) > s{k* — ry) — /r > 0 for sufficiently 
large N. Since the rate of events that increase the number of type k* individuals by one is thus 
always greater than the rate of events that decrease the number of type k* individuals by one. 
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the process (t),t > 0) is a submartingale. By Doob’s Maximal Inequality and Lemma ISTTl 
for sufficiently large N, 


P 


sup 

te[o,t*] /^/ 


E[Xl,{t*)] ^ Ar/*+i(e"** -1)^* 
- s’^*+^k*l 


This expression tends to zero as —?> oo by Lemma 15.101 which, in view of Lemma 15.21 implies 
the result. □ 


Proposition 5.12. For sufficiently large N, 


P[Xj{t*) = 0 for all j > and t G [0,t*]) > 1 — 

Proof. Each individual of type k* acquires mutations at rate /i. Therefore, by Lemma l5.ll the 
expected number of times, before time t* A r, that a type k* individual acquires a {k* + l)st 
mutation is at most 


Nfi 


^ f,E[Xl,{t)]dt< 


fc*+l r-f 

Jo 


st 


- 1 )^* dt < 


NyL‘ 


k*+l^sk*t* 


sk*+lk*\k* 


This expression tends to zero as —>■ oo by Lemma 15.101 and the fact that k* 

The result now follows from Markov’s Inequality and Lemma 15.21 


oo as A^ —)• oo. 

□ 


6 Proof of part 1 of Proposition 13.8 


Recall that Proposition 13.51 states that M{t) is close to zero for t < and close to j during the 
time interval [ 7 j, 7 j_|_i). The time C 2 can be interpreted as the first time at which the approxi¬ 
mation to M{f) given in Proposition 13.51 fails to hold. Part 1 of Proposition 13.81 stipulates that, 
during the time interval [t*, aj\fT], the time C 2 can not happen until either (^1 or ^3 has occurred. 
That is, as long as the behavior of the type j individuals follows the description in Propositions 
13.2113.31 and l8. 61 the mean number of mutations in the population must satisfy the approximation 
in Proposition 13.51 

Note that part 1 of Proposition 13.81 is a deterministic statement. To prove it, we will assume 
that Co = 00 , meaning that until time t* the population behaves according to Proposition 13.11 
We will show that if t G {t*,aNT] and Ci A Cs > L then the approximation in Proposition 13.51 is 
valid up through time t. We begin with two lemmas. The first one gives a useful bound that 
follows from (13.18P and (j3.191) . and the second one shows that if t < Tj+i, then type j individuals 
contribute little to the mean number of mutations at time t. 


Lemma 6.1. Let Co = Co + 1 + 4(5, where Co comes from fg. jg|) . Suppose j > k* + 
Tj+i < Ci,j and Tj+i < onT. Suppose also that either Tj+i < Cs or j < J. Then for 


large N, 


s 

Coh 


< exp 




1. Suppose 
sufficiently 

( 6 . 1 ) 


Also, suppose k* + 1 < j < J, Tj < t < Ci,j, and t < a^T. Then for sufficiently large N, if 


we have Tj+i < t. 



( 6 . 2 ) 
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Proof. Suppose j > k* + 1, Tj+i < Cij, and Tj+i < ajyT. If Tj+i < ( 3 , then (13.261) gives 
Tj+i > Tj + a7v/3fc7v > Tj. Now suppose instead j < J. Then for t < t* f\ Cij, part 1 of 
Proposition 13.31 gives Xj^i{t) < s/2/i. Since Xj^ 2 (t) = 0 for t < part 2 of Proposition 13.31 gives 
for t < T* A Ci,j, 

P n ( \ r! / \ 

< (1 + 4,5)e^"^' < (1 + = (1 + 4S) ( ^ j 

which by (j3.27p is less than s/2/x for sufficiently large N. Thus, Xj{t) < s/fi if N is sufficiently 
large and t < Tj A Cij, which means r^+i > t* in this case also. 

Recall from (ll.21h that Xj{Tj+i) = \s/fi]. When > r*, equations (j3.18j) and (j3.19j) with 
t = Tj+i give 

(1 - 4 d)e^^r ^^4-) dv < < (1 + 4(5 + 6 - 3 ) 6 ^^'^' 

The result m now follows immediately from rearranging this equation and observing that 
|'s//i]/((s//x)(l — 4(5)) < 2 for sufficiently large N. 

To prove the last statement of the lemma, suppose k* + l<j<J,Tj<t< Cij, t < ajyT, 
and (16.2|) holds. For t < t* , if N is sufficiently large, then 

J Gj{v) dv < sJ{t* - Tj) < log , 

contradicting (j6.2p . Therefore, we must have t > t*. If Tj <t < Tj+i, then equation (j3.19p gives 
Xj(t) > 2(1 — 4(5)(s//r) > s/fi, contradicting the definition of Tj+i. Thus, if N is sufficiently 
large, then < t, as claimed. □ 

Lemma 6.2. If t £ {t* ^a^T] and Ci A Cs > t, then 

1 T 

j=k*+l ^ 

Proof. Suppose j > k* + 1. If the statement of part 1 of Proposition 13.31 holds, then no early 
type j individual can get a type j + 1 mutation until after time Tj+i A onT. No other type j 
individual appears until after time f,j > tj. Thus, we have Xj^i = 0 for t < r^+i A Tj A a^T. 
Also, Tj+i > Tj A onT, as noted in Remark 13.41 Thus, since we are assuming that Ci A Cs > T we 
have Xj^i{t) = 0 on the event that t < Tj. Therefore, when t G (t*,aArT] and Cl A Cs > t, there 
can be at most one value of j for which r^+i > t but Xj{t) > 0. 

Because Cs > L the calculation in (|3.28p implies that tj > t and thus Xj{t) = 0 for all j > J. 
Because Xj{t) < sj when t < r^+i, the result follows. □ 

The approximation in Proposition 13.51 has four parts. The first part pertains to the case 
t < otv, the second part pertains to the case t G [otv, 7A:*+i)) and the third part pertains to the 
case in which t G [ 7 j, 7 j+i) for some j > A:* + 1. The fourth part will be a consequence of the 
first three. Proposition 16.31 below handles the case of t < oat. 

Proposition 6.3. For sufficiently large N, on the event that (q = 00 and Ci A Ca > t, we have 
for all t G (t*. Oat], 

M{t) < 
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Proof. Fix t G (t*,a]\f]. We will assume throughout the proof that (q = oo and Ci Cs > F 
Suppose first that 0 < j < Write a = (1 + (^)^/(l — S)'^. By equations (13.121) and (13.Sp and 
the fact that Gj{t) — GQ{t) = sj for all f > 0, we have 


N ~ Xo{t) ~ ^^(j.*^jfGQ{v)dv sijl ~ sij\ 

Therefore, 


1 

'n 


[feivj 


E 

i=i 


L^jv J . 

s»E^ 

i=i 


/ie 


st \ 3 


^ A s ) - s 


(6.3) 


Now 

tf^st _ ^gsajvg-s(ajv-i) _ g-s(ajv-i) 
S S 

Because Q-^GN-t) < \ gl/i/sle®* ^ g^ -i- fQiiQ.^g 


1 

'n 




1=1 




(6.4) 


(6.5) 


Next, suppose j G {kj^,k^). Then, using (13.101) instead of (13.81) . the same reasoning used in 
(16.3p gives that for some positive constant Gj, 


Xjjt) ^ 

N ~ s^j\ 

For sufficiently large N, there will be at most one integer in the interval {kf^,k^). In this case, 
using (16.41) and then using that (/i/s)e^* < 1 for the last inequality, we get 


V E :iMt) < 

iG(A;jy,fc+)nZ 





( 6 . 6 ) 


Consider now the case in which j > k* + 1 and rj+i < t. Then by (I3.20p . 

Xj{t) < (1 + S)(s/f3)eAi (6.7) 

The assnmption that t < Ci entails that t* < Tk*+i < Tj+i in view of part 3 of Proposition 13.11 
and Remark 13.41 so using (I3.12p . we get 

s/fi < Xfc*(rfc*+i) < (1 + ,5)Xfc*(C)e/**'‘^' 


Therefore, another application of (I3.12p leads to 


Xk*{t) > (1 - 6 )Xk*{t*)e^t*Gk*{v)dv y 


{l-S)s P^^G,,{v)dv 
(1 + 5)fi 


Thus, since Gj{v) — Gk*{v) = s(j — k*) for all n > 0, combining (16.71) and (16.81) leads to 

(,s(3-k*)(t-Ti+i) - Plfl, Gk*{v) dv 

Xk^t)- 1-5 


( 6 . 8 ) 
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Now 


(■"O'+i 


j-k* 

Gk* (v) dv = 

''^k*+l jn=l 

, j-k 


J-k* . 

S(/ 


''"fc*+m+l 




> 


E 

m=l + 


''"fc*+m+l 


dv - s{j - k*){Tj+i - Tfc.+i). 


By Lemma l6.11 for sufficiently large N we have 


.J-k* 

(E / 

^ m=l 


’■fe*+m+l 


Gfc.+m(u) dv > 


T,= l + 

Combining these observations gives that for sufficiently large N, 

j-k 


CqH 


j-k* 


Xk*it) 


< a{l - 




„s(j-A:*)(rj+i-Tfe«+i) 


= q;(1 — (5) 




j-k* 


^s(j-k*)(t-T^*+i) 


Since Tk*+i > t* and e ^ \ it follows that for sufficiently large N, 


x,{t) 

Xk*{t) 


< a 




j-k* 


■■■'TV 


st\ j-k* 


V sP 


Thus, making the substitution i = j — k*, for sufficiently large N, 

-j CXD CX3 . OO 

j=k*+l j=k*+l ^ ^ ’ i=\ 


C%[ie 

sk% 


st\e 


(6.9) 


In view of (|6.4I) . we see that /sk'jf —)• 0 as N" —>■ 00 , and therefore the infinite sum on 

the right-hand side of (j6.9jl is dominated by the leading term when N is large. Therefore, for 
sufficiently large N, using (16.4p again. 


N 


E j^ji^)M 


Tj+l<t} — u2 

I^N 


j=k*+l 


2aCek* _ ^ ^ 2aC&k* -s(aj,-t) 


k% 


( 6 . 10 ) 


It remains only to consider the case in which j > k* + 1 and Tj+i > t, for which the necessary 
bound is given in Lemma 16.21 Combining (j6.5p . ()6.6[1 . (jO.lOp . and Lemma 16.21 we get that for 
sufficiently large N, 


M{t) < 


Cj 2aCek* 


^-s{aN-t) 


As N ^ OO, clearly Cy/I'fejy]! 0 and 2aC%k*/k‘j^ 0. As for the fourth term, we have 

^s{aN-t) < gsoiv — which means 


jg^s(aN-t) 


Js 

WJj? 


( 6 . 11 ) 


as OO by (11.81) and (II.9|) . The result M{t) < 3e follows because ae < 3 by (13.11) . □ 
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We next consider the case in which t E {aN,'yk*+i)- During this period of time, the mean 
number of mutations in the population increases rapidly from near zero at time to near k* at 
time 7 a:*+i- The upper bound on the mean number of mutations given by Proposition 16.51 below 
will be sufficient for our purposes. Before stating this proposition, we prove a lemma which will 
also be useful in studying the population at later times. 

Lemma 6.4. Suppose j and I are positive integers with j > k*. Let aj = (1 + (5)^/(l — 5) if 
j = k* and aj = (1 + (5)/(l + (5) if j > A:* + 1. Suppose t € n [0,aArT]. Suppose 

also that (q = oo and Ci A Cs > t. Then for sufficiently large N, 

Proof. Assume for now that j > k* + 1. Then because > t, the bounds in ()3.20p . combined 
with the facts that 'yj+x < Ij+e+K and Tj+i < by Remark 13.41 give 


X,{t) 


< 


Ofje'^A+i+i 


Gj-f.i{v) dv 


/_ Gj (v) dv 
e ^3+1 


- r. dvj!.^^(Gj+,(v)-Gj(v}) dv 


= a,e 


= n ■p~ ^ 7 + 1 '*'^ Gj+i(v) dv se(t-Tj+i} 


e 


a,e 


( 6 . 12 ) 


If instead j = k*, then we use (I3.12|] . as in (j6.8|] . rather than (I3.2n|] to get the lower bound on 
Xj(t), and we again obtain (|6.12p . In both cases. 


fTj+i+l ^ PTj+rn + 1 

/ Gj+i{v) dv = ^ I Gj+i{v) dv 

■^7+1 m=l' 


'1=1 ■^7+' 
I 


E 

m=l ''•^7+ 


Tj' + m+l 


Gdv -|- s(£ ?7l)(Tj_|-m+l T~j+m) 


We now apply Lemma IHTT] and (j3.26p to get that for sufficiently large N, 


f'^j+e+i \ / s 

exp ( / Gj+i{v) dv ) > 

' 7+1 


Geti 

s 


exp ( s{£ — m) 

m=l 

' saisfi(i — 1 ) 

exp ■ 


a-N 

3k X 


C^pL J \ Qkx 
Because = s/yt, combining this inequality with (|6.12l) gives the result. 


□ 


Proposition 6.5. There is a positive constant C 4 such that if N is sufficiently large, then for 
all t E (aAr,7fc*+i), on the event that Co = cc and A (3 > t we have 

XI(t^ < k]\f -|- C*4. 

Proof. Suppose t E {ax,'yk*+i)- Suppose also that Co = cc and Ci A Cs > t. Note that 

-OO .jOO -CXD -OO 

= + = + (6.13) 

^ j=o ^ j=o ^ e=i ^ i=i 
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By Lemma 16.41 for sufficiently large N, 


^ OO 




+£+!<*} — 2-^ ^{'^k*+t+l<i} 


f=l 


<(1^^ 

l=l 


1-6 




„ , t(t-l)/6fcjv 
r \ sl{t-Tk*+i) 


Because t — Tk*+i < 7a:*+i — "^^*+1 = o-n and = {s/nY, we have for sufficiently large N 

t(£-l)/6fciv 

... <LL^L^V/r7.^f^' 

N 


Tk*+t+l<i-} — 


1-6 


'6 


(6.14) 


1=1 £=1 
If ri denotes the ^th term in the sum on the right-hand side of (I6.14p , then ri = Cq and for ^ > 1, 


Tj+i _ Cejl + 1) (A 


ri i 

which tends to zero as N ^ oo because 

l/3fcjv 


\S J 


\ l/3fcAr 

<2C,|^) . 


(6.15) 


log 




3kN 


log (- 




[log(s//i)]- 

Slog 


which tends to —oo as —)> oo by (1121). Therefore, the first term dominates the sum on the 

right-hand side of (|6.14p for sufficiently large N, so for sufficiently large N we have 


1 °° 




l=l 


{1 + 5Y 

1-6 


2 Cg. 


(6.16) 


Finally, Lemma [621 and equations p.Sp and (|1.9p give 


1 Js 

^ -t 0 as A^ OO. 


(6.17) 


j=k*+i 


Because k* — ^ 0 as N ^ oo hy (|S.5I) . the result follows from (|6.1SI) . (I6.16p . and 

(16171) . □ 

It remains to consider the case in which t G [ 7 ^, 7 j_|_i) for some j > k* + 1. In this case, we will 
need to consider carefully the contributions to M{t) not just from individuals with an unusually 
large number of mutations, as in the proofs of Propositions 16.31 and 16.51 but also from individuals 
with an unusually small number of mutations. Therefore, we will use the following two lemmas, 
which parallel Lemma 16.41 

Lemma 6.6. Suppose j and i are positive integers such that j — i > k* + 1. Suppose that 
t G [ 7 j, 7 j+i^] n [OjOAfT]. Let ai{t) = (1 6)/{I -6) if t < yj-i+K, and let a£{t) = k%/{l - <5) if 

t > Suppose also that (q = 00 and Ci A (^3 > t. Then for sufficiently large N, 


Xj-e{t) 

Xjit) 


< ae{t) 




l{l-l)/6kN 
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Proof. Because Ci'^Cs > ^ and t E [ 7 ^, 7 ^+/^], we can use (|3.20l) to obtain a lower bound on Xj{t). 
Also, we can obtain an upper bound on Xj^i from (I3.20p when t < and from (13.211) 

when t > 7j_£+x. This leads to 


/ \ / \ r* Ga-Av) dv 

Xj{t) - J^,^^Gjiv)dv 


(6.18) 


Therefore, 




XAt) 


^-Irf^"GAv)dvJ^.GjA)dv 


Gjiv) dv 


Because Ci A Cs > t, it follows from Lemma l6.II that for sufficiently large N, 


GAv) dv 

A ' 


Also, using (I3.26p . 


r 

Tj-i+l 


Gj_i{v) dv ='^ ( f Gj-m{v) dv - s{£ - m){ 

m=l ^ '^^ 3 -m 

m(^) dv J 


- s{l - m){Tj-ra+l - Tj-ra) 


^ ^ /-A—+1 ^ ^ ^ sl{t - l)a7v 

dvj 

^m=l A-"i / 

SO using Lemma 16.11 again, for sufficiently large N, 

The result now follows from (|6.19p . (|6.20p . and (|6.2ip . 


Jrf_e +1 Gj-A'») dv ^ 


(6.19) 


( 6 . 20 ) 


( 6 . 21 ) 

□ 


Lemma 6.7. Suppose i and j are positive integers such that 0 < i < k* and j > k* + 1. Let 
K{t) = 1 + J if t < and let K{t) = if t > lk*+K- There is a positive constant G% such 

that if N is sufficiently large, then for all t E [ 7 j, 7 j+i^] n [0,a7vT], on the event that (q = 00 and 
Cl A (3 > t we have 

X^ 

^t) 

Proof. Because Ci > t, equations (I3.12p and (13.1311 give 


< GsK{t)2^-'‘ k 


-k* ( A 


N 


0-fc*)(j-fc*-l)/6fcjv 




Xi{t) < K{t)Xi{t*)e^t* GAv) dv ^ )e 


-s{k*-i){t-t*) 


Xk*{t* 


■ Xk*{t*)e^t*GkAv)dv^ ( 6 . 22 ) 


Because we are working on the event that Co = 00 , we can use the bounds on Xi(t*) and Xk*{t*) 
from (13.8p and p3.10p . Recall that for sufficiently large N, there is at most one integer j such 
that kfi < j < k'jif, which then must be k*. Let 


Ai 


1 

(1 + 5)47^1 
(l + 5)/(l-h) 


if i = A:* 

if i < A;* and kf^ < k* < k^ 
if i < k* and k* < kA 


45 






























Because dj > 0, it follows from Proposition 13.11 that for sufficiently large N, 


Now k*l/il < and 






-(e- -ir 


e^** - 1 

oSt* 


i-k* 


< < 


-k* 


1 as N ^ oo, 


so for sufficiently large N, 

Xk*{t*) 


<2Xi{- 


k*-i 


(k 


^\k* —i st* (i—k*) —s{k* —i){t—t*) 


= 2 Xi I - 


k*-i 


{k* 


'^^k* —i —s{k* —i)t 


Also, equation (I3.12p implies that for sufficiently large N, 


1 + - > Afc*(Tfc*+i) > (1 - 


and therefore, 


^ ^ - 1-6 


Combining (j6.22P (j6.23l) . and (|6.24p . we get that for sufficiently large 

X,it) < -V 

1-6 

By p3.20p . for sufficiently large N, 


, . (1 — 6 )s f* Gj(v)dv 

Xj(t) > - -" . 




Combining this result with (I6.25P gives that for sufficiently large N 

i 

{k 


\ ^ I / \ k*—i fi GutMdv 

< 2XiK{t)^xLJ^( (y*^k^-i,-s{k^-i)t r 


X,{t) 


{l- 6 Y\^i 


fl Gj (n) dv 


(6.23) 


(6.24) 


(6.25) 


(6.26) 


Note that the ratio of exponentials on the right-hand side of (|6.26l) is the same as the ratio of 
exponentials on the right-hand side of (16.181) with j — kX in place of 1. Consequently, the argument 
used to prove Lemma 16.61 gives 


( 2 sV~^* / N 0-fe*)(i-fc*-i)/6fc^ 


+ 1^/ij 


^-s{j-k*){t-Tj) 


Putting this result together with p6.26p gives that for sufficiently large N, 


W / n X ,.U^ni-k* l + ^i/s fs'^^ * 


{j-k*){j-k*-l)/6kN 


{k 


*'\k*-i -s(k*-i)t -s{j-k*){t-Tj) 


(6.27) 
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Because Co = oo, we have tj > t* = { 6 /s)logk}\f, where 0 = 4 if < /c* < and 6 = 2 
otherwise. Therefore, recalling (|1.22p . 




< e 


-s{j-i)(t-Tj) u-S{k*-i) 






(6.28) 


Also, because k* — k]\f < ^ 0 as N ^ oo hy p.Sp . we have 

j * \ k*—i / 7 + \ k*—i 
\ , f AT 

, 


< 


N 

kw 


1 as iV —>■ oo. 


Recall that Aj = 1 when i = k*. Also, for i < k*, we have A* = (1 + 5)/(l — <5) when 9 = 2 and 
Aj = (1 + 5)k‘^/Ci when 0 = 4. It follows that for sufficiently large N, 


*^k*-i,-e{k*-i) , (e-i){k*-i) 2{1 + 6 ) ,-(k*-i) 

) — ■ ri j: I 

min|l —d, Cl} 


Combining ^6.271) . ^6.281) . and p6.29p gives the result. 


(6.29) 


□ 


Proposition 6.8. There exists a positive constant such that for sufficiently large N, if t G 
[ 7 j, 7 j+i) n [0, a^T] for some j > k* + 1 , then on the event that Co = oo and Ci A Cs > t, we have 

\M{t) - j\ < 

Proof. Throughout the proof, we work on the event that Co = oo and Ci A Cs > t- We also assume 
that t G [ 7 j, 7 j+i). Note that 


1 1 
\M{t) -j\< iX^^,(t) + ^Y 


N 

1 


e=i 

oo 


£=1 

j-k*-l 


YixMt) + ^ Y £x,.i{t) + ^Y^j-i)x,{t). 


l=l 1=1 i =0 


(6.30) 


The argument for bounding the first term is similar to that in the proof of Proposition 16.51 By 
Lemma 16.41 for sufficiently large N, 


^ OO OO 


£=i 


iXj+ejt) 
^ Xj{t) 




1 - 6 ^ 


£=l 


Now t — Tj+i = t — 7 j+i + 7 j+i — Tjj^i = t — 7 j+i + On- Since = (s/p,)^, it follows that 

gs£{t-Tj+i) _ (^^1 ^Y^-s£{')j+x-t) therefore 


1 _L A °° /, N A^-i)/6fciv 

i i- 0 /.w / r 


£=1 £=1 ^ 


-s£(7j+i-t) 


(6.31) 
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Let ri be the t'th term in the sum on the right-hand side of (16.311) . Then ri = Cqb and 

for £ > 1 , 

which goes to zero as —>■ oo by the argument following (16.150 . Therefore, the first term 
dominates the sum on the right-hand side of (j6.3ip . so for sufficiently large N we have 


^ OO 


(6.32) 


i=i 


Also, by Lemma 16.21 


1=1 j=k*+l ^ ^ 


Because t > Cs, equation (I3.26p gives 'Jj+i — 7 j = Tj+i — tj < 2a]\f/kN- Therefore, 

jseA7i+i-7,) 


(6.33) 


< 


- 


0 


Nfi ~ 

as A" —)• OO by (I6.11|) . Combining this result with (16.320 . we get that for sufficiently large N, 


1 

N 


f2^Xj+eit) < {2Ce + l)f 


-«(73+l-i) 


(6.34) 


£=1 


Consider now the second term in (j6.3O0 . Suppose i < j — k* — 1, so that j — £ > k* + 1. As 
in Lemma IhThl write ai{t) = (1 -|- (5)/(l — 5) if t < and ai{t) = k'j^/{l — 6 ) t > 7 j_£+i^. 

Then Lemma 16.61 implies that for sufficiently large N, 


Xj-e{t) 

Xj{t) 


< ae{t) 




e(e-i)/6kN 


,-s£(t-Tj) 


Because 7 ^ — Tj = and = s//r, we have 


-sl{t-Tj) _ ^-se{t-'yj) p-siaff _ [ \ p-seit-'Yj) 


(6.35) 


Therefore, for sufficiently large N, 

Xj{t) 


< ai{t )2 


£( p-stit-'Tj) 


and so 


j-k*-i 


4 E <E «»(*)«'(f 


e{e-i)/ 6 kN 




(6.36) 


l=l £=1 

Let vi denote the £th term on the right-hand side of (16.360 . Note that t < ^j+i < ''yj-i+K as long 
as N is large enough that K >2. Therefore, vi = 2((1 -|- 5)/(l — and for £ >1, 


ve+i ^ 2 k%{£ + 1) ^ 
ve ~ £ Vs 


C3fciv 


-Ai-7.) < 4 :k%(^(^'^ 


l/Skff 
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To see that this expression tends to zero as N ^ oo, note that 


log 



2 log kN 



2 log kN 


[log(s//i)]^ 

Slog 


(6.37) 


which tends to —oo as —>■ oo by assumption A2. Therefore, the first term dominates the sum 

on the right-hand side of (I6.36P when N is large. For sufficiently large N, we therefore have 


j-k*+l 

- (6.38) 

£=1 

Finally, we consider the third term in (j6.30p . Suppose 0 < i < k*. Define K{t) as in the 
statement of Lemma 16.71 By Lemma 16.71 for sufficiently large N, 


k* k* 

- i)Xi{t) < X] 

i=Q z=0 


{j - i)Xi{t) 

Xjit) 


< C8K(t)2^-^* 


ij-k*)ij-k*-l)/6kj^ fc* 
i=0 


Because j — k* > 1, we have for i G {0,1,..., A;*}. Also, if we let 

Vi = {j — i)kj'^ then Vi-ijvi < 2/kN —>■ 0 as A" —>■ oo for i G {1,2,... ,k*}. Therefore, for 
sufficiently large N, the sum Yli=o dominated by the i = k* term, and we get 

j2{j-i)k-^^*-^U2 {j-k*). 

i=0 


It follows that 

k 


1 

A 


E(j - i)Xi{t) < 2C8Ac(t)2^-"*(j - k*){^^\ 

i=0 ^ ^ 


(3-k*)(j-k*-l)/Gkf^ 


=-«h-7j) 


(6.39) 


for sufficiently large A. If j = A:* -|- 1 and A is sufficiently large, then K{t) = 1 -|- d, and so 


2C^K{t)2^-^\j-k*)[ ^ 


= 4(1 +,5)^8. 


(6.40) 


If j — k* > 2, then K{t) < k‘j^. For £ > 2, let 


Wi = 2 Csk% 2 H{ ^ 


r(r-i)/6fciv 


Then, for £ > 2, we have wi^xjwi < 3(/i/s)^/^*^^, which tends to zero as A —oo by the argument 
following (j6.15p . Therefore, for sufficiently large A, the i = 2 term is largest, so if j > k* + 2, 
then 


/ ii \ 0-fc*)0-fc*-l)/6fciv / \ l/3fcjv 

2C8^(A)2^-'^*(j - k*)i^^j < 16Csk%(^^j 


(6.41) 
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which tends to zero as —>■ oo by the argument around (I6.37p . Combining (I6.39P with the 

bounds in (16.401) and (16.4111 gives that for sufficiently large N, 


4 1^0 - i)X,(t) < SCse-C-’JI. (6.42) 

i=0 

The result now follows from (|6.30p . (|6.34l) . (|6.38l) . and (|6.42l) . □ 

Remark 6.9. If t G [t*, 7^+1] n [0, a^T], then on the event that Co = cc and Ci A Cs > h follows 
from (|6.32p and (|6.33p that 


CXD 

i=j+l 



where we get s/Nfj, in place of Js/Nfj, for the second term from the argument in the proof of 
Lemma 16.21 that there can be at most one value of i for which Tj+i > t but Xi(t) > 0. Likewise, 
if t G [7j,7j_|_x] n [0, ajsfT], then on the event that Co = cc and Ci A Cs > L equations (j6.38p and 
(I6.42P imply that 

i=0 

In particular, for t G [ 7 j, 7 j+i), unless t is close to jj or 7 j+i, nearly all individuals in the 
population at time t will be of type j. 

Proposition 16.101 below establishes the fourth part of Proposition 13.51 Part 1 of Proposition 
13.81 follows immediately from Propositions 16.3116.5116.81 and 16.101 


Proposition 6.10. For sufficiently large N, if t ^ [Tj,Tj^i) for some j > k* + 1, then on the 
event that Co = 00 and Ci A Cs > t, we have M{t) < j — 1. 

Proof. Suppose Co = 00 and Ci A Cs > L Suppose t G [tj, Tj+i), where j > k* + 1. We consider 
three cases. First, suppose t < on- Then M(t) < 3 < j — 1 by Proposition 16.31 for sufficiently 
large N. 

Second, suppose t G {aN,^k*+i)- Then M{f) < A:Ar + C '4 for sufficiently large N by Proposition 
16.51 Because t > Csi the result of part 1 of Proposition 13.61 implies that Tk*+i < 2aN/kN- 
Therefore, (|3.26|) implies that for sufficiently large N, 


2ajv 2 a 1 ^ 


kN 


(Pi) 


< ON- 


Therefore, Tj+i > ajy > which means j > k* + k]\f/3. For sufficiently large N, we are 

guaranteed k^ + C 4 < k* + kN/3 — 1, and thus M{t) < j — 1. 

Finally, suppose t G [ 7 ^, 7 ^+ 1 ) for some £ > k* + 1. Then M(t) < i + 2 C 5 for sufficiently large 
N by Proposition 16.81 Also, since t > 'ji = Tg + , equation (|3.26l) gives 

2 a]\i kpf 

'T+fe^/2 <r£ + — -^ ^ < "B'+i’ 

which means j > i + kj^/2 — 1. Since i + 2 C 5 < i + kjsf/2 — 2 for sufficiently large N, we again 
obtain M{t) < j — 1. □ 
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7 Proof of part 2 of Proposition 13.8 

Recall that Proposition 13.61 consists of three parts. The first part simply bounds Tk*+i- The 
second part is concerned with which can be interpreted as the number of new types that 

have emerged between times a^it — 1) and a^t. The third part pertains to the spacings between 
the times Tj. 

The time Cs is the first time at which one of the statements of Proposition 13.61 fails to hold. 
Part 2 of Proposition l3.8l stipulates that Cs can not happen until either or Q has occurred. That 
is, as long as the behavior of the type j individuals follows the description in Propositions 13.ll and 
13.21 and the mean number of mutations in the population behaves as described in Proposition 
13.51 the results of Proposition 13.61 must continue to hold. Part 2 of Proposition 13.81 like part 
1, is a deterministic statement. To prove it, we will assume that Co = oo. We will fix a time 
f E [t*, a^T] and show that if Ci > t and C 2 > t-, then Ca > t, which means that the conclusions 
of Proposition 13.61 are valid through time t. 


7.1 An upper bound on Tk*+i 

In this subsection, we establish the following result, which gives part 1 of Proposition 13.61 

Proposition 7.1. For sufficiently large N, on the event that Co = oo, Ci > ‘^cln/I^n, and 
C 2 > ‘^CLN/kN, we have Tk*+i < 2a]\f/k]\f. 

Proof. Suppose Co = 00 , Ci > 2a7v/^Ar, and C 2 > 2aj\f/kj\f. We need to show Xk*{2aN/k^) > s/fj,. 

By dan, 

Xk42aN/kN) > (1 - (7.1) 

Because C 2 > 2aN/kN, we have M{v) dv < 3/s by part 1 of Proposition 13.51 Therefore, 

since 2fj.aN/kN —>■ 0 as A —)• 00 by (11.81) . for sufficiently large N we have 

|^2aN/kN r'ia.M/kN 

/ Gk*iv) dv = sk*{2aN/kN — t*) — fi(2a]\f/kN — t*) — / sM{v)dv. 

Jt* Jt* 

>sk*{2aN/kN-t*)-A. (7.2) 

Also, by Proposition 13.11 if we set d = 0 when k* < kf^ and d = dk* when k* > kff, we get 

Xk*{t*) > min{(l - 6),Ci}k~'^Xk*{t*). (7.3) 

Combining (|7.1P ' f|7.2P ' and ()7.3p . we see that there is a constant c > 0 such that 


cN fjJ^ 

Xk*{2aN/kN) > 


gSt* _ 
est* 


j —d 2sfc*ajv/fcjv 


Because (1 — e ^**)^* ^ i as A ^ oo, to show that Xk*{2aN/k^) > s/^ for sufficiently large A, 
it suffices to show that 


lim 

N^OO 


^h- 2sfc*a^f/fc/v 

Sk^+lk*\ N 


= OO. 


(7.4) 
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Arguing as in ()5.48l) . we get 


log -k*-l) log - k* log k* + k* + + oikN) 

= (^kN - k* -1 + log - k* log k* + k* + o{kN)- (7.5) 

Because k*/k^ —)> 1 as iV —)> oo, and A:Ar — A:* > /sat — A:)^ —)• 0 as —)• oo by (j3.5[) . the first 

term on the right-hand side of (17.5p is at least (1/2) log(s//u) for sufficiently large N. Because 

[kN log /cat)/ log(s//i) —0 as A" ^ oo by assumption A2, it follows that the first term dominates 
the right-hand side of (I7.5|] . and thus the expression in (j7.5p tends to infinity as N ^ oo. Hence, 
(j7.4p holds, which completes the proof. □ 

7.2 Approximating R{aNt)/kN by q{t) 

In this subsection, we establish the second part of Proposition 13.61 which states that R{aNt)/k^ 
can be well approximated by q{t), where q is the function defined in (I1.13p . The first lemma 
collects some properties of the function q. 

Lemma 7.2. There is a unique bounded function q : [0, oo) ^ [0, oo) satisfying The 

function q is right continuous on [0,oo) and continuous on [0,1) U (l,oo). Also, 1 < q{t) < e for 
all t > 0 and 

lim q{t) = 2. (7.6) 

t^OO 

Proof. Note that (jl.l3p is equivalent to the renewal equation 

Q{t) = g{t) + [ q{t- u)f{u) du, (7.7) 

Jo 

where f{u) = g{u) = l{o<u<i}- That this equation has a unique solution which is nonnegative 
and bounded on every finite interval is a consequence of Theorem 2 in m- Another consequence 
of Theorem 2 in [TO] is that the function t i—)• q{t) — g{t) is continuous, which implies that q is 
right continuous on [0,oo) and continuous on [0,1) U (l,oo). 

To obtain the bounds on q, let u = inf{i > 1 : q{u) > e or q{u) < 1}. Suppose u < oo. Then 
either q{u) = e or q{u) = 1. However, q{u) = q{t) dt G (1, e), a contradiction. Thus u = oo, 
which means 1 < q{t) < e for all t > 0. Equation ()7.6I) is a consequence of Theorem 4 in [lOj . 
See also Remark 01 □ 

The next lemma controls the value of R{t) for t < a^. 

Lemma 7.3. Let 0 < r] < 1. If N is sufficiently large, then for all t € [0, oat), on the event that 
Co = oo, Cl > L o,f^d C 2 > t, we have 

(1 - < R{t) < (1 + r?)A:Are(^+^)'/“^. (7.8) 

Proof. On the event Co = 00 , Proposition 13.II implies that tj > t* for all j > A:* -|- 1 and therefore 
R{t) = k* for t G [0,t*]. Because k*/kN —)• 1 as A —>• 00 and t* /on ^ 0 as A —)• 00 by (II.7p . it 
follows that for sufficiently large A, equation (17.8p holds for all t G [0,t*]. 
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Consider next the case in which t* < t < oat. Suppose also that Co = cc, Ci > t, and C 2 > t. 
Let 9 > 0. If k* + 1 < i < J and r^+i < t, then Lemma l6.II implies that for sufficiently large N, 




< eS-r dv ^ ft 


2s 

^ ’ 


(7.9) 


Note that 


rTt+i t"^t+i 

/ Gi{v)dv = si{Ti+i-Te)-s M{v) dv - fi{Ti+i - Ti). (7.10) 

Jri Jri 

Note that > t* by parts 3 and 4 of Proposition 13.11 Therefore, because C 2 > t, part 1 of 
Proposition 13.51 implies that 

fTl + l fO-N fO-N 

0<s M{v)dv<s / M{v)dv<3s / < 3. (7-11) 

J Tfi J t* J t* 


Since 2e^ < 41 and fiajsf —>• 0 as N" —>■ 00 , it follows from ()7.9p . ()7.10p . and ()7.1ip that for 
sufficiently large N, 


Cefi 


< Qs£(Tt+i-n) ^ 


41s 

d 


Therefore, for sufficiently large N, 


{l-e)aN 1-6*, fs\ /l + ^i f 

- i -= — j - \J‘) 


(1 + 9)aN 


(7.12) 


Furthermore, by repeating the above argument with t in place of Tj+i, we see that for sufficiently 
large N, if < t and t — T£ > {1 + 9)aN/^, then 


Gi{v) dv > {1 + 6)a]sfS — 3 — fiajsf > log 


'Tt 


(!)■ 


(7.13) 


in which case the last statement of Lemma l6.II implies that r^+i < t. 

Therefore, if fe* + 1 < j < J and tj < t, then (j7.12p implies that for sufficiently large N, 






1 


t>Tj> ^ (rf+i - Te) > (1 - 9)aN j > «Af(l - ^) log 

£=k*+i e=k*+i ^ 


J 


/c* + 1 / ’ 


and rearranging this equation gives j < {k* + In view of (|3.23p . it follows that for 

sufficiently large N, 

R(*)<(r + l)exp(^-^). (7.14) 

Likewise, equation (17.121) and Proposition 17.11 imply that if k* + 1 < j < J and tj < t, then for 
sufficiently large N, 




Tj = Tk *+1 + ^ (r^+i - Tl) < + (1 + 9)aN ^ ^ < OAT f ^ + (1 + 0) log 




1 


/ 2 


kN 

£=k* + l l=k* + l 

and the observation following (I7.13P thus implies that if 


\kN 


j - 1 


i > OTV ( ^ + (1 + 6*) log 


4-1 
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or equivalently if j < 1 + A:* exp([t/a 7 v — 2/A:Ar]/(1 + 6 )), then tj < t if iV is sufficiently large. It 
follows that 

R{t) > 1 + k* exp (---— — -— -— ] (7.15) 

V«iv(l + 0) kN{l + 0 )J 

for sufficiently large N. Because /cat —>■ oo and k*/k]\[ —>■ 0 as N —)• oo, we can see from (17.1411 
and (j7.15ll that for sufficiently large N, equation (I7.8|) holds for all t E as long as 9 is 

chosen to be sufficiently small relative to ij. □ 

We next consider the value of R{t) for t E [a]sf,aNT]. We will find it useful to introduce the 
following notation. For t E [0, A uatT), let 


M{t) 


0 if t < Oat 

k* if t E [aAr,7fc*+i) 

j if f E [ 7 j, 7 j+i) for j > k* + 1. 


(7.16) 


Note that M{t) is well-defined because, by Remark 13.41 we have Tj < and therefore 7 j < 

7 j+i, whenever Tj < Cl- As long as the conclusions of Proposition 13.51 hold. M{t) is a good 
approximation to the mean number of mutations in the population at time t. 


Lemma 7.4. If C 2 > cln, then 

\M(t) — M(t) \ dt < -. 

s 

For sufficiently large N, if C ,2 > 7fc*-i-i and Ci > 2aN/k]\f, then 



pk 

J ajsf 


7fc*+i _ 2k* 

\M{t) — M(t)\ dt < - — ajv- 
kN 


Finally, for all j > k* + 1, i/ C 2 > 7i+i then 

r/j+i 


\M{t)-M{t)\dt < 


2C5 


'ij 


(7.17) 


Proof. The first and third statements follow immediately from integrating the result of Proposi¬ 
tion 13.51 For the second statement, note that for sufficiently large H, we have kj^ -t- C 4 . ^ 2k . 
Then for t E [dAf)7fc*+i)) h follows that when C 2 > 7fc*-i-i) we have 0 < M(t) < 2k* and thus 
\M{f) — M{t)\ < k*. The result follows because when Ci > 2aN/kN and C 2 > 7fe*-i-i) we have 
7fc*+i — Oat = Tfc*+i < 2aN/kN by Proposition 17.11 □ 


Lemma 7.5. Suppose j > k* + 1. Also, suppose t E [Tj,Tj+i) and either Ci > t or C 3 > t. Then 
R{t)=j-M{t). 

Proof. First suppose that t > 7fc»+i, so that t — on > Tk*+i. Then M{t) = i implies that 
t E [ 7 £, 7 £+i), and thus t — ajy £ [T£,Tg^i). Thus, in view of Remark 13.41 when Ci > A or (13.2611 
when Cs > t, the times Ti+i,Ti+ 2 , ■ ■ ■ occur in the interval {t — aN,t]. Because R{t) is the 
number of integers i > fc* -|- 1 such that t — on < U < t, we have R{t) = j — I, as claimed. 
The other possibility is that t < 7 fc*+i. Because t — aw < Tk*+i, the times Tk*+i, ■ ■ ■ ,Tj occur in 
the interval {t — a]\f,t]. Therefore R{t) = j — k* A t > aw and R{t) = j if t < aw, which again 
matches the conclusion of the lemma in view of (j7.16p . □ 
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The lemma below is the key to obtaining the integral equation for the limit function q. 


Lemma 7.6. Let 0 < rj < 1. If N is sufficiently large, then for all t E [a]\f,ai\fT], on the event 
that (q = oo, > t, and (2 > t, we have 

[ R{u) du < R{t) < [ R{u) du 

Jt—aN Jt—aN 


provided that 

< inf R{u) < sup R{u) < Skjsf- (7-18) 

2 ttG[0,t] uG[0,t] 

Proof. Fix t E [a^v, ajyT], and suppose Co = 00 , Ci > t, and C 2 > t. Let Li = min{j : tj > t — oat} 
and L 2 = max{j : Tj < t}. In view of Remark 13.41 we can write 


{t - aN,t] = {t- aN,TLf\ U 



U {TL2,t]. 


For u E [0, t], let 


c/ f 0 if u < Tk*+i 

^ \ Gj{u)/s if t E [rjjTj+i) for j >k* + 1. 

If Li < j < L 2 , then since j < J by Remark 13.71 Lemma l6 .1 1 implies that for sufficiently large N, 


Dividing by s, we get that for sufficiently large N, 

^l-|^aAr< j 5(tt) du < (7.19) 

Because C 2 > t, we have S{u) >0 for all u E [0,t) by the result of part 4 of Proposition 13.51 
Combining this observation with the last statement of Lemma 16.II we get 

0<y S{u)du = -j Gj(u) du < - log ^ < ^I + Oat (7.20) 


for sufficiently large N. Likewise, if Li = k* + 1, then S{u) = 0 for u < and if Li> k* + 1, 
then 

f S{u) du< f 

Jt—aN ^ 

Therefore, Lemma l6.II implies that for sufficiently large N, 

0 < S{u) du<^ log < ^1 + oat. (7.21) 

By Remark 13.41 the times rx,^,r^^+i,...,rxj are in (t — aj\f,f\, so R{t) = L 2 — Li + 1. Therefore, 
we can sum (j7.19p over j from Li to L 2 ~ 1 and combine this result with (I7.20p . and (|7.21l) to 
get that for sufficiently large N, 


{R{t) - 1) OAT < y S{u) du < {R{t) + 1) 


1 + 


ON- 
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Rearranging this equation, we get, for sufficiently large N, 

rt I 

S{u) du < R{t) < 1 + -Tj- 

t—a^ 


1 + /-I , - / S{u) du < R{t) < 1 +- -- f S{u)du. (7.22) 

(1 + r?/3)ajv Jt-ai, (1 - dl^)aN Jt-a^ 


We now relate S{u) to R{u). By Lemma 1731 if u E [rj,rj_|_i) n [0,t] with j > k* + 1, then 
R{u) = j — M{u). Therefore, for u E [Tj,Tj+i) n [0,t], 


S{u) = 


Gj{u) 


= (j - Miu)) + (M(u) - Miu)) -- = Riu) + (M(u) - M(u)) - 
s s s 


If 0 < t < Tfc*+i, then S{u) = 0 and R{u) = k*. Therefore, 


f |*S'(n) — R(n)| du < k*Tk*+i + 
J t —ajv 


I \M{u) — M{u)\du +—qn- 


(7.23) 


By Proposition 17.11 for snfficiently large N, 


k Tk*+i < — ajv- 

kN 


(7.24) 


The nnmber of values of 'yi between t — aN and t is the same as the number of values of ti between 
t — 2aN and t — a^, which is either R{t — ajy) or R{t — ajy) — k* depending on the value of t. 
This means that at most R{t — qn) + 1 intervals of the form [ 7 f, 7 f_|_i) can intersect the interval 
[t — a]sf,t]. Therefore, by Lemma 17.41 for sufficiently large N 


L 


3 2k* 2 Ck 

\M{u) — M{u)\ du < —h -—UN H- {R(t — Oat) + 1). 

(t-ajv)VTj,*+i S kN s 


(7.25) 


Therefore, combining (I7.23p . (17.241) . and (|7.25p . we get that for sufficiently large N, 

rt Q Ah* an ,, 

/ \S{u) - R{u)\ du< - + - — aN H- -[R{t - aN) + 1) + —cln- 

kN S S 

Therefore, if (I7.18P holds, then for sufficiently large N, 


— f |<S'(tt) — R{u)\ du < 

ttN J t—aff 


3 + 2C5{3kN + l) f4:k* // 


SUN 


+ 


kN s 


Because soat —>■ oo by (|1.8I) . it follows that for sufficiently large N, when (17.181) holds we have 


1 


■ f |<S'(u) — R(u)| du < ^kN < [ R{u) du. (7.26) 

Jt—aiv 6 3aN Jt—aja 


Jt—aff 3aN Jt—apf 

From (|7.22p and (j7.26p . we conclude that for sufficiently large A^, when ()7.18p holds we have 


-1 + 


1-7/3 


R{u) du < R{t) < 1 + 


1 + 7/3 


R{u) du. 


(1 + r]/3)aN . 

’ t —(1 - r]/3)aN J 

t —(2TV 

The result follows since 1 —7 < (1 — 7 / 3 )/(l + 7 / 3 ) < (l + 7 / 3 )/(l — 7 / 3 ) < I + 7 if 0 < 7 < 1. □ 
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The following deterministic result will help us to obtain the second part of Proposition 13.61 
from Lemmas 17.31 and 17.61 

Lemma 7.7. Let 0 < r] < 1. Suppose r : [0,T] —M zs a right continuous function such 
that (1 — < r{t) < (1 + for 0 < t < 1 and (1 — rj) r{u) du < r{t) < 

(1 + r]) r(u) du for 1 <t <T. Let q be the function defined in Then 

sup |r(t) — q{t)\ < 
ze[o,T] 


Proof. Let ri : [0, T] —)• [0, oo) and r 2 : [0, T] —)• [0, oo) be the unique bounded functions satisfying 


(1 — r])e^^ if 0 < t < 1 J (1 + if 0 < t < 1 

(1 — 1 ]) ri{u) du if 1 < t < T, ^ | (1 + zy) r2{u) du if 1 < t < T. 


The existence and uniqueness of these functions, and their continuity away from 1, follows from 
Theorem 2 in |10] as in the proof of Lemma 17.21 because the functions ri and r 2 satisfy (|7.7jl if 
we replace the functions / and g by fi and gi or /2 and <72 respectively, where fi{u) = gi{u) = 
(1 - ry)l{o<„<i} and / 2 (u) = g 2 {u) = (1 + ?y)l{o<n<i}- 

We claim that ri(t) < r{t) < r 2 {t) and ri(t) < q{t) < r 2 {t) for all t G [0,T]. To see this, 
let u = inf{t : r(t) > r 2 {t)}. Seeking a contradiction, suppose u < T. Clearly n > 1, and so 
r 2 {u) — r{u) > (1 + ry) — fit)) dt > 0, which contradicts the right continuity of r and r 2 . 

Therefore, r(t) < r 2 {t) for all t € [0,T]. A parallel argument gives r{t) > ri(t) for all t G [0,T]. 
The result for g is a special case of the result for r, which completes the proof of the claim. 

Let d{t) = r2{t) — ri{t) for all t G [0,T]. The claim above implies that 


sup \r{t) — q{t)\ < sup d{t). (7.27) 

tG[o,r] ie[o,r] 


We have d{t) = (1 + “ (1 “ r])e^^ for t G [0,1]. Note that if t G [0,1], then 

d{t) < d{l) < + 2rye^+^ < 4rye^+T (7.28) 


If 1 < t < T, then 

d{u)du + 2ri / ri{u) du. 

1 Jt-i 

Therefore, using that 0 < ri(t) < q{t) < e for all t by Lemma 17.21 we see that if 1 < t < T, then 
d'{t) = (1 + ?y)(d(t) - d{t - 1)) + 2 r]{ri{t) - ri{t - 1)) < (1 + ry)d(t) + 2ery. 


d{t) = {l+r]) f r 2 {u) du - {1 - rj) f ri{u) du = {1 + g) f 

Jt-i Jt-i Jt- 


Solutions to the differential equation f'{t) = (1 + ?y)/(t) + 2ery can be expressed in the form f{t) = 
—2e?y/(l+7y), where C is a constant. If /(I) = d{l), then C = {d{l)+2eg/{l+g))e~^^~^'^'>. 
Therefore, if 1 < t < T, then 

d{t) < < 4rye(^+^)‘. (7.29) 

The result follows from (I7.27p . ()7.28ll . and (j7.29l) . □ 
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Proposition 7.8. For sufficiently large N, on the event that Co = oo, we have 


R{aNt) 

kN 


(lit) 


< 6 


for all t G [0,T] such that Ci > Oivt and (2 > OAfi- 

Proof. Suppose that Co = cc, Ci > ajyt, and C 2 > a]\ft. Choose r] > 0 small enough that 
4 jyg(i+r))r ^ Pqj, ^ g [0,T], let r{u) = R{aNu)/kN. Lemma [TlHl implies that if u < 1 and u <t, 
then (1 — ri)e^^~^'^^ < r{u) < (1 + . Define k = mf{ti : r(n) > 3 or r{u) < 1/2}. By 

Lemma EH if 1 < w < k and u <t, then 

pu ru 

{1 — T]) / r{v) dv < r{u) < {1 + 7 ]) / r{v)dv. 

J U—1 J U—1 

Note that R is right continuous, and therefore so is r, so we can apply Lemma lT.TI to the function 
r to get 

sup |r(rt) — q{u)\ < 6 . (7.30) 

nG[0,t]n[0,K) 

The result will follow from (|7.3UI) if we can establish that k > t. In view of Remark 13.41 we have 
\R{u) — R{u—)\ G {—1,0,1} for all rt G [0, awt]- In particular, if k < f, then |r(K)—r(K—)| < l/Zcw, 
which contradicts (I7.30j) for sufficiently large N because 1 < q{u) < e for all u > 0 by Lemma 
17.21 Therefore, k > t, and the proof is complete. □ 


7.3 The spacings between tj and rj+i 

The third part of Proposition 13.61 primarily pertains to the spacings between Tj and Tj+i. The 
proposition below establishes the necessary relationship between the times Tj and the function q, 
and leads easily to the main result (I3.26h . 

Proposition 7.9. If N is sufficiently large, then for all j G {k* +1,..., J — 1} such that Co = 00 , 
Cl > Tj+i, C2 > 'Cj+ij D+i — (^nT, we have 

fTj+l/aN 1 + 25 

/ qiu) du < (7.31) 

Jr+aN kN 

and 

fTj+l/aN 1-25 

/ (9(«) + l{ne[1.7fe*+i/aiv)}) ^ -7-• (7-32) 

J i~j j a fsf 2 * 

Also, if N is sufficiently large, then for all j G {k* + 1,..., J — 1} and all t G [0, onT], on the 
event that = 00 , Ci > t, and C 2 > t, if 



q{u) du > 


1 + 25 
kN 


(7.33) 


then Tj+i < t. 
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Proof. We prove the result by induction on j. Suppose Co = oo, Ci > Tj+i^ C 2 > ^j+i^ and 
Tj+i < ajyT. Suppose also that (I7.3ip and (17.321) hold with i in place of j for t E {/c* + l,..., j — 1}. 
Let r] > 0. From (j7.19p we see that if N is sufficiently large, then 


(1 



dv < (1 + r])aN. 


By Lemma 1731 for v E [Tj,Tj+i), 


Gj{v) 


j - M{v) 


s 


R(v) + (M(v) - M(v)) - 

s 


(7.34) 


(7.35) 


Let Lj be the number of integers i > k* + 1 such that j£ E [rj, Tj+i). Then the interval [r^, Tj+i) 
intersects at most Lj + 1 intervals of the form [ 7 £_i, 7 £), so by Lemma 17.41 if N is sufficiently 
large, then 

f \M{v) - dv <- + ^ (7.36) 

Jrj S S 

By the induction hypothesis, (|7.32p holds if j is replaced by £ E {k* + 1,..., j — 1}. By Lemma 
17.21 we have q{u) < e for all u > 0. Also, 7 fc*+i/a 7 v = 1 + Tk*+i/aN < 1 + 2/k]\[ for sufficiently 
large N by Lemma l7.ll Since q is right continuous and q{l) = e—1, it follows that for sufficiently 
large N, we have 

sup {q{u) + l{ne[i, 7 fc*+i/aiv)}) <e + 6. (7.37) 

u>0 

Thus, using (13.ip and (I7.32h . for sufficiently large N, 


7^+1 -li = n+i - Tl> 


aAr(l — 25) 
kN{e + 5) 


QAf 

3k N 


It follows that Lj < 1 + {3kM / aN){Tj+i — Tj). Combining this observation with (I7.36h gives 


^ M. .. /3 + 4C5 , (iSkNC^\ 

\M{v) - dv < ---+ j 


V7+1 


Write C = 3 + Because kN/{aNs) —>• 0 as A" ^ 00 by (11.71) and ^/s ^ 0 as A —>■ 00 , it 
follows that for sufficiently large A, 


CTj + l 


\M{v) - + ^') dv<^+ ^('^j+i “ Tj). (7.38) 

O / O 


Combining (j7.38l) with (17.341) and p7.35p . we get that for sufficiently large A, 


/ Wv) + {M{v) - M(u))l{^g[„^_.^^.^^)}) dv - ajv 
J Ti 


C 

< rjUN + — + viTj+i - Tj). 


To simplify notation, write h{v) = {M{v)—M{v))l^y^[aM,'yk*+i)}' the substitution u = v/qn 

and divide both sides by a^kN to get 


^Tj+i/ajv / ^ h{aNu) \ 

Tj/aN \ kN J 


1 

kN 


< J-+ ^ + 7('^i+l - Tj) 

~ kN SUNkN CLNkN 
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for sufficiently large N. By Proposition 17.81 we have \R{aNu)/kN — q{u)\ < S for u < Tj^i/aN, 
so for sufficiently large N, 


r^j+i/o-N 
Tj I aj^ 


h{aNu)\ 1 




kj^ saNkj\f 




ttNkN 


UN 


We now pursue the upper and lower bounds separately. In view of part 2 of Proposition 13.51 
because (2 > p/'+i) have h{v) > k* — kjq — for all v E [o 7 v, 7 fc*+i) H [Tj,rj+i). Therefore, 
because k*/k]\f —>■ 1 as —)• 00 and 7 fc*+i — = Tk*+i < 2aN/kN by Proposition 17.11 for 

sufficiently large N we have 


/ 


Tj + l/o-N 


Tjjaiq 


h{aNu) 

kN 


du > 


k* - kN - C4 \ / 7fc*+i - aAf \ ^_ 'n_ 

kN J \ CLN J ~ kN 


Combining this result with (17.391) yields 


I'Tj+i/aM l + 2rj C — Tj) d{Tj + l — Tj) 

/ q{u) du < —- - +-— + dW+l-11 + ^J+1 - 

iTj/aN SttNkN ClNkN OAT 


(7.40) 


Since sun —>■ 00 , we have C/{saN) < fl for sufficiently large N. Therefore, bringing the last two 
terms on the right-hand side of (17.401) to the left-hand side, we get 


L 


Tj + l/o-N 
/ajv 




for sufficiently large N. Also, since q{u) > 1 for all rt > 0 by Lemma 17.21 we have q{u){l — a) < 
q{u) — a for all ti > 0 and a > 0. Therefore, for sufficiently large N, 


L 


'Tj+l/o.N 


Tjjapf 


q{u) du < [ 1 — - 5 


Vl + 37 

kN 


The upper bound (|7.3ip follows as long as rj is chosen to be small enough relative to 6 . 

To obtain (j7.32p . note that h{v) < k* for all v € [aAr,7A:*+i) H [Tj,Tj+i). Therefore, for 
sufficiently large N, 


L 


rj+l/ajv }i{aNU) 


/ajv 


CAT 


du < 


< 


f 


'Tj+l/O'N 


/“AT 
'Tj+l/o-N 




k* — kN \ (^k*+i — cln 


k 


N 


Qn 


m+ 1 /a.N 

/ ^{«G[l,7j,*+i/ajv)} 


Combining this result with (I7.39P and using that saN ^ 00 as N ^ 00 , we get for sufficiently 
large N, 


L 


Tj+i/aN 

{q{u) + l{.g[l, 7 ,.+,/a^)}) du > 

Tj/aM 


1-377 r]{Tj+l - Tj) 6{Tj+l - Tj) 


UNkN 


ttN 


and therefore 


/ 

J Tr 


Tj + l/o-N 


71 \ 1 _ 3/? 

q{u) + + ^ + S] du> . 
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If X > 1 and a > 0, then x(l + a) > x + a. Therefore, for sufficiently large N, 

The lower bound ()7.32p follows as long as rj is chosen to be small enough relative to 5. 

It remains to prove the last statement of the proposition. Suppose now that Cq = oo, Ci > t, 
C 2 t < ajyT, and (j7.33p holds. We need to show that Tj+i < t. By Lemma iBTl if N is large 
enough, it suffices to show that 

Sr. Gj(v) dv 2S 

Therefore, it suffices to show that for sufficiently large iV, 

[ dv > {1 + ri)a]s[. (7.41) 

Jr, S 


Using (I7.35|) . the bound in part 2 of Proposition 13.51 and the reasoning leading to (j7.38p with t 
in place of Tj+i, we get for sufficiently large N, 

f Gj{v) f dv - ^ - r]{t - Tj) - {k* -kN - C' 4 )( 7 fc+i “ o^n)- (7.42) 

J T4 ^ J T-i 


By Proposition 17.81 sufficiently large 

ct pt/a^^ 


/ 

J Td 


R{v) dv = ajsi 


jlo,N 


R{a]sru) du > a^k^ 


rt/a^ 

( 

rj / ajv 


{q{u) — 5) du. 


(7.43) 


Using Proposition 17.11 we have {k* — k^ — Ci){'^k*+i — o-n) < {k* — kj^ — C4){2aM/k^) < r/aw 
for sufficiently large N. Combining this bound with (|7.42l) and (17.431) . and then using (I7.33p . we 
get that for sufficiently large N, 


G,{v) 


dv > a^kN 


rt/o-N 

JrjlaN 


Qiu] 


V 


kN J s 


rjUN 


> ttNkN O “ 


rt/aN (J 

/ q{u) du - rjUN 

'TjlaN ^ 


>ajvfl-<5-^Vl + 2(I)-- 


kN J 


rjUN, 


which implies ()7.41ll as long as rj is chosen to be small enough relative to 6 , in view of the fact 
that sttN —00 as iV — 00 . □ 


Proof of part 2 of Provosition HOI Recall that q{u) > 1 for all u > 0 by Lemma YT72\ Therefore, 
if (I7.31h holds, then 

U’+i ~ U ^ 1 + 26 ^ 2 
ajv kN kN 

Also, in view of (I7.37p . for sufficiently large N, if (17.321) holds, then 

U+i - U > 1 - ^ 1 

On ~ (e + 6)kN ~ 3kN 

Thus, if (j7.31l) and (I7.32p hold, then so does (|3.26p . Also, if Tj ■ + 2aAr/feAr < unT, then (17.331) 
holds with t = Tj + 2aN/kN- Therefore, part 2 of Proposition 13.81 follows from Propositions 17.11 
EEl and[721 □ 
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8 Proof of part 3 of Proposition 13.8 

To prove part 3 of Proposition 13.81 we need to show that with high probability, the results of 
Propositions 13.21 and 13.31 hold as long as the results of Propositions [H3] and [SlUl hold. Propositions 
13.21 and 13.31 describe the behavior of the number of type j individuals. The proof proceeds by 
induction on j, in the sense that to show that the number of type j individuals behaves as 
predicted, we will need to know that the number of type j — 1 individuals does so. Define the 
stopping time 

Pj = Co A C2 A Cs A Ci,i-i A qnT. 

We then need to show that 


J 

P{{Co = 00} n {Ci,i < Pj}) < |. ( 8 . 1 ) 

j=0 

Essentially, this means that the number of type j individuals behaves as expected with high 
probability until after time pj. 

Note that if t < pj, then the reasoning in Remark 13.71 implies that no individual of type J + 1 
or higher can appear until after time t. Because assumption A3 implies that sk^ —^ 00, we have 
sJ < 1 for sufficiently large N. It follows that 1 + s{j — M{t)) > 0 for all j > 0, and therefore 
Gj{t) = Gj{t) for all j > 0 as noted in (14.6h . Throughout this section, we will assume that N 
is large enough that sJ < 1, which will make it possible to ignore the distinction between G}{t) 
and Gj{t). 


8.1 Individuals of type j<k* 

In this subsection, we consider the behavior of individuals of type j for j G {0,1,... , k*} and show 
that with high probability this behavior matches what is described in Proposition 13.21 Central to 
the analysis will be the martingales Zj’'^ from Corollary 14.31 with k = t* and r = {pj f\^k*+K)^t* . 
To lighten notation, we denote this process by Z'y We let p* = {pj A 7fc»+x) V t* and then, for 
t >t*, we let 






Z'j(t) = (8.2) 

Jt* 

Note that when j = 0, we are using the convention X-i{u) = 0. For t E 

Xj{t) = ‘^^Xj{t*) + [ pXj_i{u)e^^ du + '^’^Z'(t) 

Jt* 

~ r?',i(^)++r?',3(^)5 (8-3) 


where Tj^i{t), Tj^ 2 {t), and denote the three terms in the previous line. To establish the 

result of part 1 of Proposition 13.21 we need to show that |Tj, 2 (t) + Tj^ 3 {t)\/Tj^i{t) < 6 with high 
probability for t G [t*,p*]. We first bound Tj^ 2 {t)/Tj^i{t). 

Lemma 8.1. For sufficiently large N, if 1 < j < k*, then on {Co = cc}; 

M,-i(r) <5 

sXj{t*) 3' 
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Proof. Suppose Co = oo. By (|3.8I) . if j < kj.^, then 

nXj_i{t*) ^ //(I + <5) _ Xj-i{t*) ^ 1 +J _ j ^ l + <^ _ k* 
sXj{t*) ~ s(l — 5) Xj{t*) 1 — 5 — 1 ~ 1 — 5 /c^ — 1 

Suppose instead j G {kff,k'}^). Because —)• 0 as iV ^ oo by (13.5h . for sufficiently large 

N we know that j — 1 < kf^. For such N, because dj < 2, equation p.lOp yields 

nXj_i{t*) ^ fi{l + 6 ) _ Xj-i{t*) ^ 1 + <5 _ j ^ 1 + (^ _ k%k* 

sXj{t*) - Cis kff^Xjit*) <^ 1 ( 1 -J) - 1 ) “ Ci(l-J) k%-l' 

Because the right-hand sides of (18.41) and (18.511 tend to zero as —>■ 00 , the result follows. □ 

Lemma 8.2. For sufficiently large N, if 0 < j < k* and t G {t*,p*], then Tj^ 2 {t)/Tj^i{t) < bffi. 

Proof. Since ^ 0^2 (A) = 0, we may assume 1 < j < k*. Because Cij-i < Pj^ the conclusion of part 

1 of Proposition 13.21 holds for j — 1 up to time p*. Therefore, if n G {t*,p*), then 

Xj_i{u) < (1 + 5)Xj_i{t*)eft" dv_ 


It follows that if A G {t*,p*], then 

rt J 

Tj,2{t) < p{l + d)Xj_i{t*) / 

Jt* 

= /i(l + 5)Xj_i{t*)e^i* Gffi) dv f 

Jt* 

< Xj_i{t*)eJt* Gffi) 

Thus, if A G (A*,/?t]j then 

r,-2(A) ^ p{l + 5)X,_^{t*) ^ 

Tjp{t) ~ sXj{t*) 

The result now follows from Lemma l8.11 □ 


To bound Tj^ 3 (A)/Tj^i(A), we will need to control the fluctuations of the process (Zj(A), A > A*). 
The following preliminary bound will be useful. 


Lemma 8.3. For sufficiently large N, if 0 < j < k* and u G {t*,p*], then 

Gj{v) dv^ < w{u)e~^^^'^~^ \ 


exp 


where 


w{u) 


21 if u ^ (A*, OTv] 

{s/p)'^^’^/^ ifu>aN. 


( 8 . 6 ) 
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Proof. Note that 


- /j* Gj{v) dv _ sM{v) dv+^i{u-t*) 


e 


gjt* 


(8.7) 

In view of parts 1 and 3 of Proposition [3^ we have — < n^k*+K < n{aN + ‘^KaN/kisf) —)• 0 

asA^—)-oo. Iftt<aAr, then J)* M{v) dv < 3/s by Lemma 17.41 and therefore, for sufficiently large 

N, 

sM(v) dv+ii{u-t*) < g3+/.(n-t*) < 21 = yj(u). (8.8) 

Suppose instead qn < u < 'yk*+K- By the results of Propositions 13.51 and 13.61 

K-l 


fajv 


M(v) dv< M(v) dv + 


nk*+i 

'aN 


M{v) dv + 


r7fe*+f+i 


M{v) dv 


£—1 •'7fe*+« 


K-l 


< 


+ {kN + C^{lk*+i ~ O'n) + {k* + I + 2C'5)(7fc*_|_£_|_i — ^k*+t) 


£=1 


< 


2 ai\[ / ^k]\[ 
kj\f \ 2sajkf 


+ {kj\f + C*4) + (iP — 1)(A:* + iP + 2C^) 


Because K = [kf^/A\, we have K{kj\£ + K) < (5/16)A:^. Since the other terms are of a smaller 
order of magnitude for large N, it follows that there is a positive constant c < 1/3 such that 
3k]sf/{2saN) + (fciv + C 4 ) + {K — l){k* + K + 2 C 5 ) < ckf^ for sufficiently large N. Hence, for 
sufficiently large N, 

ru 

/ M{v) dv < 2caNkisf 

Jt* 


and therefore 


/(« sM{v) dv+£L(u—t*) ^ ^2saj^kj^/3 — ( f_ 


eJt 




2k]\^/3 


= w{u). 


The result follows from (j8.7p . (|8.8p . and ()8.9p . 

Lemma 8.4. For sufficiently large N, if 0 < j < k*, then 


(8.9) 

□ 


p( sup |Z'(t)| > ^-X,{t*)\ < 

Vteh*,p*] 2 J 


Gd/CAT 


Proof. The process {Z'At),t > t*) is a mean zero martingale. By Corollary 14.31 and (|5.5p . for 


t > r. 


Yai{Z'Jt)\FtA < E 


rt£^p*j 

/ e-2/t* gFv) + 3Xj{u)) du 

Jt* 




= E 


e- JJ* (^) {^e- JJ* Gj {v) dvx._Au) + 3e- (n)) du 


Ev 


. ( 8 . 10 ) 


For u < p*, the conclusion of part 1 of Proposition 13.21 holds for j — 1 through time u, and so 


g- If* Gj{v) dvx._^^u) < (1 + 5)Xj_i{t*)e^t* Gj-i(v) dv^- /“ GFv) 
= (l + <5)Xj_i(C)e-'*(“-‘‘). 


dv 


( 8 . 11 ) 
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Plugging this result and the result of Lemma 18.31 into (|8.10l) , and then bringing the conditional 
expectation inside the integral, we get for t > t*, 


Var(Z;.(t)|J-t*) 


< E 


— En J 


du 




Jt* 

Jt* ^ 

Because {Z'-{u),u > t*) is a martingale with = 0, we have E[Z'-{u)\Ft*] = 0 for u > t*. 

Using this fact along with (18.2p followed by (18.lip , we get for u > t*, 


E[ 


j ft* 


..uAp ■ 


{u<p*}\:Ft*] <E[e-^t* 

ruAp* 


= E 


< E 


ru/\p- 

/ pXj_i{r)e-^tfGEv)dv^^ 

Jt* 

ruApt 

pj + 6)Xj_i{t*) / 

Jt* 


+ X,it*) 


Tt 


t* 


+ Xjit') 




Thus, for t > t*, 


Yar{Z'{t)\Et*) < + 6)Xj_i{t*)(^l + ^) + du. 

By Lemma [8Tl for sufficiently large N we have p{l +6)Xj^i{t*){l + 3/s) < Xj{t*) on {^o = oo}. 
Therefore, for t > t*, if N is sufficiently large, then on {Co = oo} £ Tt*, 


YaT{Z'At)\Et*) <4:Xj{t*) [ du. 

Jt* 


( 8 . 12 ) 


When j = 0, we take N large enough that > 21, and then, using the bound that 

^-sj{u-t*) ^ equations (I8.12p and (|8.6p imply that on {Co = oo}. 


/ \ 2fcjv/3 

Var(Z'(t)|J)*) <4X,(U)Uj t. 


(8.13) 


When 1 < J < A:*, we break the integral in (|8.12p into two pieces and use (|8.6I) to get that on 
{Co = oo}. 


yar{Z'{t)\Et*)<4Xj{t*){21 



apf / Q \ 2fc]v/3 /-CO 


4Xj{t*) 


(Hi) 


e-sj{u-t*) i 

2 ^ ai /3 ^—sj(aN—t*) 


' poo 

J ajx 


S3 


21 + e 


sjt 


* S 


SJ 

—j+2A;jv/3 


(8.14) 
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Parts 1 and 3 of Proposition 13.61 imply that if ^k*+K < Csi then 


2 Kajq 3a]\i 

lk*+K = Tk*+K + OAT < —-h Oat < 


(8.15) 


In particular, we must have p* < 3a]\f/2. Combining this observation with the Maximum 
Inequality, we get that on {(^o = oo}. 


P 


( sup \Z'{t)\>^-X,it*)TtA 


<P sup \Z'^it)\ > 

\ iG[t*,3ajv/2] ^ 

4Var(Z'(3aAr/2)|J)*) 


Xr 


< 


(5X,(C)/2)2 

1024/cArVar(Z'(3a7v/2)|J't- 


64/cAr 


e5^Xj{t*)^ 


(8.16) 


If we can show that, on {^o = oo}, the second factor on the right-hand side of (|8.16p is less than 
one for sufficiently large N, the result will follow by taking expectations of both sides in (I8.16|) . 
We will assume that Co = oo and show that this factor tends to zero as N —>■ oo, uniformly in j. 

We consider separately the cases j = 0 and 1 < j < k*. Suppose first that j = 0. We have 
-^o(i*) > (1 “ by Proposition 13.11 so using (I8.13p . 


fcArVar(Zo(3aAr/2)|J)*) ^ 6kNaN f ^ 6kNaN f 

Xo{t*Y - Xo{t*)\p) - {l-5)N\p) 


(8.17) 


Note that 
log 


k^aN ( 

-[-) j=logt« + log 

= log kN + log 


-I- log log 
-I- log log 


— log N + log 


(;) 


(;) 


which tends to —oo as N ^ oo because (logA:Af)/(logN') —>■ 0 as N' —)• oo and because, by 
assumption Al, we have log(l/s)/log A" —0 and (loglog(s//i))/log A —)• 0 as A ^ oo. It 
follows that the expression in p8.17p tends to zero as A ^ oo. 

Next, suppose 1 < j < k*. Then, using ()8.14p . 


A:ArVar(Z'(3aAr/2)|At.) q/sAr 


X,{t 




< 


sjXj{t* 


21+ e 


sjt* 


—j+ 2 kMp 


(8.18) 


We will show that the two terms on the right-hand side of (I8.18|) each go to zero as A ^ oo. 
For the first term, we use Proposition 13.11 equation p5.27p . and the fact that \og{l/s)/k^ 0 

by assumption Al to get 


log 


kN 


sjXj{t* 


< log 
= log 


klrS^jl 


minlCi, 1 — 6 }sjNpl {e^^* — !)■? 

. — 7 ^) - log A -h j log -h log j! - j log(e^** - 1) 

minjCi,! - (5}sjy \pj 

log N + j log + j log j - j - sjt* + o(/cAr). (8.19) 
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If j < ~ log-/V + j log(s//x) < 0, and sjt* > 2j log/ cat. Therefore, the expression in 

(j8.19l) tends to —oo as N ^ oo. If instead j G {k^,k'^), then t* = (4/s) log/ cat, and we can we 
write j as in (133]) to get 

log + logN + bjkN log /cat + j log j - j - 4j log k^ + o{kN), 

which tends to —oo as N ^ oo because bj < 2 and (15.321) holds. Thus, the first term on the 
right-hand side of (I8.18|) tends to zero as N ^ oo. To bound the second term, we use (I8.19|) to 
get 

(/) ) = - log iV + ^ log + j logj - j + o(k^). 

= -g logiV+ /log j - j + o{kN), 

which tends to —oo as N ^ oo because (fcAr log A;Ar)/log —)• 0 as —)• oo. It follows that the 
right-hand side of ()8.18p tends to zero as N ^ oo. □ 

Proposition 8.5. For sufficiently large N, if 0 < j < k*, then 
P((l - 6)Xj{t*)eft* dv < Xj{t) < (1 + S)Xj{t*)e-^t* dv ^ ^ {t*,p*]) > 1 - 

D4A^/\r 

Proof. By (|8.3I) . we have 

(1 - 5)Xj{t*)e^l* < Xj{t) < (1 + 6 )Xj{t*)e-^t* (8.20) 

as long as Tj^ 2 {t)/Tj^i{t) < 5/2 and |rj^ 3 (t)|/Tj^i(t) = \Z'-{t)\/Xj{t*) < 5/2. Therefore, the result 
follows from Lemmas 18.21 and 18.41 □ 


Proposition 8.6. For sufficiently large N, if 0 < j < k*, then 

P{Xj{t) > k%Xj{t*)e^t*Gffi)dv ^ ^ {'yk*+K,Pj]) < (8.21) 

and 

P{Xj{t) > 0 for some t G [jk*+L, < 4^' 

Proof. Fix j G {0,1,... , k*} Assume for now that (I8.20p holds for all t G {t*,p*]. Then, on the 
event {'yk*+K < Pj}i for all £ G {1,..., j} we have 


A,_i(7A:*+i^) ^ + ^ (l + (5)A,_i(r)e-^fa*+^-**) 

Xiffik^+K) ~ {I - 5)Xi{t*)e'f^t*^*^’^ 

Note that on {'yk*+K < Pj}, the result ^3.261) implies that for sufficiently large N, 

* aiviK — 1) 14aAr 

7k*+K-'t > aN + Tk*+K -'’'k*+i > a.N-\ -• (8.24) 

3/CAr 13 
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(8.25) 


Combining (18.231) and (I8.24p with Lemma ISTTl we get that on { 7 ^*+;^ < Pj}, for sufficiently large 
N, 

^e-lilk^+K) ^ (1 + S)Ss I 4 gajy /13 ^ (1 + S)S / s^\ 

X^(7k*+K) -3(l-,5)/i 3(1-<5) Uy 

which tends to zero as —>• 00 . This means that, among individuals with j or fewer mutations, 
the fraction with J mutations at time 7 k*+K must tend to one as N —>■ 00 . Recalling that 
Sj(t) = Xoit) + Xi(t) + • • • + Xj{t), for sufficiently large N we have 


Sj{7k*+K) < -Xj{7k*+K) 


on the event { 7 k* +k < Pj}- 

By Proposition 14.61 and Remark 14.71 the process 


p('1k* + K+P'^Pj r \ j 

(e '’Sj{{7k*+K + t) ^ Pj), t > 0 ) 


is a nonnegative supermartingale. Therefore, 


P 


i 


— f* Gn (v) dv , . kfj , , 

sup e ^^k*+K Sj{t) > ^5j(7fc.+x) 


JR 


V t^('yk*+K, pA 

Combining this result with (|8.26p . we get 


■P( 'S'iW > for some t G {7k*+K,Pj\ 


'yk*+K ) — 


< 


A-2 ■ 

I^N 


^'rk*+K ) — 


(8.26) 


(8.27) 


vr- ( 8 - 28 ) 


Taking expectations of both sides of (j8.28p . and then using Proposition 18.51 along with the facts 
that Xj{t) < Sj{t) for all t > 0 and ej^Ak^ + 2 /A:^ < e/48/i;Ar for sufficiently large N, we obtain 

dMH). 

To get (I8.22p . observe that when the complement of the event in p8.27p holds and pj > 7k*+Li 
we have 


Sj{7k*+L) < ^Sj( 7 fc.+x)eJrfc‘+x 

For V € [7k*+K, Pj), the result of Proposition 13.51 implies that for sufficiently large N, 
Gj{v) = s{j - M{v)) -p< s{k* -{k* +K- 2 C 5 )) < - — . 


r'lk*+L , 


(8.29) 


(8.30) 


Also, the result of Proposition 13.61 implies that if pj > 7k*+L then for sufficienly large N, 

lk*+L - lk*+K > [L - K) ■ > —-—. 


(8.31) 


Also, Sj{7k*+K) < X, so combining (I8.29p . (18.301) . and p8.3ip . we get that when the complement 
of the event in p8.27p holds and pj > 7k*+L, for sufficiently large N, 


s,(7i-+n < 


-16fciv/15 


(8.32) 
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The logarithm of the right-hand side of (18.321) is 

log N - log 2 log /cat - log 2 = --3- log -h 2 log kN - log 2 

15 XU) 15 

which tends to —oo as N ^ oo. Thus, the right-hand side of (I8.32p tends to zero as N ^ oo and 
thus is guaranteed to be less than one if N is sufficiently large. Since Sj{'yk*+L) is an integer, 
it must be zero. Furthermore, if Sj{'Xk*+L) = 0, then Sj{t) = 0 for all t > 7 fc*+L, which implies 
that Xj{t) = 0 for all t > 'yk*+L- We can now conclude (18.220 . □ 

Remark 8.7. It follows immediately from Propositions 18.51 and 18.6] that if 0 < j < A:*, then for 
sufficiently large N, 


k* 

P{{Co = oo} n {Cij < Pj}) < {k* -h 1) r 

j=0 ^ 


+ 


+ 


64 / c 7 v 4:8k]\r 48kN 



8.2 Other type j individuals before time Tj+i 

For the rest of section [8l we assume that j € {fc* -|- 1,... , Jj. In this subsection, we focus on 
type j individuals that are not early, meaning they are descended from type j mutations that 
occurred after the time defined in (I3.16p . We will show that the claim in part 2 of Proposition 
13.31 holds with high probability. We will begin with three preliminary lemmas. 

Define the random set 

Recall from (|3.I5p that as long as qj > 1, we have qj = j — k^ j G Q and Qj = j — M{Tj) 
if j ^ 0. When j G 0, it will be difficult to bound Xj{t) as tightly as when j ^ 0, so we will 
structure the proof so that we can allow a larger probability of Ci,j < Pj when j G Q. Because 
the times Ti are spaced at least a^/^kj^ apart until time ^3 by Proposition 13.61 there can be at 
most 12 values of j for which tj < pj and j G Q. 

Lemma 8.8. There is a positive constant Cg for sufficienty large N, the following hold: 

1. If j ^ Q and t G [r^, Tj+i A pj), then s{qj — Cg) < Gj(t) < s{qj + Cg). 

2. If t G [Tj,Tj^i A Pj), then (1 — 26)skN < Gjft) < Cj{t) + p < {e + 26)skN. 

3. If Tj < Pj, then (1 — 25)1^ < qj < (e + 25)1^. 

Proof. First suppose t G [Tj,Tj^i A pj) and j ^ 0. In view of part 4 of Proposition 13.51 we have 
j — M{Tj) > 1 and therefore qj = j — M{Tj). Therefore, Gj{t) — sqj = s{M{Tj) — M{t)) — p, 
which means 

\Gj{t) - sqj\ < s(^\M{Tj) - M{Tj)\ + \M{Tj) - M{t)\ + \M{t) - M{t)\ + . (8.34) 

It follows from Proposition 13.51 that 

\M{u) — M{u)\ < max{3, 2 ( 75 } if tt ^ [ovi 7fc*-i-i)- (8.35) 
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The results of Proposition 13.61 imply that since j ^ 0, we have A pj) n [aN,'yk*+i) = 0- 

Also, because t — tj < 2a]\f/k]\f by the upper bound in (I3.26p . the lower bound in (13.261) implies 
that at most six of the times Tj can occur between times tj — ajsr and t — It thus follows from 
(I7.16P that \M{Tj) — M{t)\ < 6. Since p/s < 1 for sufficiently large N by (|1.8p . combining these 
observations with (j8.34p gives 


\Gj{t) — sqj\ < s{7 + 2max{3, 2 ( 75 }) 


which implies part 1 of the lemma. 

To prove part 2, we assume t E [Tj,Tj+i A pj) but no longer assume that j ^ 0. It follows 
from Lemma 17.51 that R{t) = j — M{t). Therefore, 


Gj{t) = sR{t) + s{M{t) — M{t)) — p. 
By part 2 of Proposition 13.61 and Lemma 17.21 we have 

A:7v(1 — < R{t) < kM{e + 5). 


(8.36) 


(8.37) 


Also, p/s ^ f) as N —)>ooby (11.81) . Therefore, if f ^ [oat, 7 fc*+i), then part 2 of the lemma follows 
from (I8.35p . ()8.36p . and (I8.37h . Now suppose instead that t E [oAr, 7 fc*+i). Proposition [T5] implies 
that k* — k]\f — G 4 < M{t) — M{t) < k*. Therefore, using ()8.36p and part 2 of Proposition 13.61 
again, we have 

sk]\fiq{t/aN) — 5) + s{k* — k^ — G 4 ) — p < Gj(t) < sk]\fiq{t/aN) + ^) + sk* — p. (8.38) 

Since 'jk^+i/aN ^ 1 as A" —)• 00 by part 1 of Proposition 13.61 and since q is a right continuous 
function with g(l) = e — 1 by Lemma 17.21 we have e — 1 — 5/2 < q(t/aN) < e — 1 + 5/2 for 
sufficiently large N. Part 2 of the lemma follows because k*/k^ —>■ 1 as A —00. 

Finally, we prove part 3. When j ^ 0, we have sqj = Gj {Tj) + p, so part 3 follows immediately 
from part 2. Suppose instead Tj < pj and j E 0, which means — /k^ < Tj < aN + 2,aN/kM- 

Since g'(l) = e, it follows from part 2 of Proposition 13.61 that kM{e — 25) < R{aj^ — 2aN/kN) < 

kN{e + 5 ) if A is sufficiently large. Therefore, in view of (I3.26p . we have k^ie — 25) < j < 

kN{e + (5) + 12 and thus kN{e — 1 — 25) < qj < kN{e — 1 + 5) + 12 if A is sufficiently large. 

Therefore, part 3 of the lemma holds in this case as well. □ 


Define 



(8.39) 


Lemma 8.9. For sufficiently large N, ifrj < pj, then Tj < f,- < (,j < t* and /^Tk^ ■ 

Proof. Because sk]\f —>■ 0 by assumption A3, for sufficiently large A we have log(l/(3sA;Ar)) > b. 
Whenever Tj < pj, Lemma f8.8l implies that qj < (e + 25)k]\f. Therefore, for sufficiently large 
A, we have Tj < ^“ < f,j. Also, because qj > (1 — 25)kN for sufficiently large A if Tj < pj, 
and because (jl.8p implies that log(s//x)/log(l/sfcAr) > log(s//i)/log(l/s) — 00 as A — 00 , for 
sufficiently large A we have 



(8.40) 


Therefore, t/ — fj > /{^Tkjq) if Tj < pj. 


□ 
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Lemma 8.10. For sufficiently large N, if Tj <t< o,nd t < pj, then 


(1 — 3(5)s ffGj-i(v)dv 


-e 1 




— — 


(1 + 3(5) S ff Gj — \{v) dv 


-e J 




Proof. If j > k* + 2, the result is immediate from (I3.20p . If instead j = k* + I, then by (13.1211 
when t = Tk*+i and the fact that s/p < Xk*{Tk*+i) < 1 + s/p, we get 




< 


(8.41) 


Because 1 — 3(5 < (1 — (5)/(l + (5) < (1 + (5)(1 + s/ p)/[{l — 5){s/ p)] <1 + 3(5 for sufficiently large 
N, another application of (13.120 gives the result. □ 

Recall that Xj^ 2 (t) denotes the nnmber of type j individuals at time t descended from an 
individual that acquired a type j mutation after time Then, using the notation of Corollary 
sa for t E [^j, Tj+i /\ Pj], we have 




X, 


, . Gj{v) dv 

, 2 {t) - J pXj_i{u)e du. 


Let 


Pj ~ "Tj+i Pj^ 


and for t>^j, let 

with the convention that Z'j{t) = 0 if pj < ^j. Then for t > ^j, we have 


Z'it)=zf^’^^+^\tAp,), 


Xjpit A Pj) = r''' pXj_i(u)ef^^'’' du + e4' 


(8.42) 




We will separately consider the two terms on the right-hand side of (I8.42p . Lemma 18.111 below 
gives the required bounds on the first term. 

Lemma 8.11. For sufficiently large N, we have 

rt 


pXj_i{u)e^'^^^^^'^^^ du<{l + 3d)e^^i 


for all t E {f,j-,Pj\ and 


J' pXj_i{u)e-f^^^^^^‘^^ du > (^1 


7(5 \ r* Gj(v) dv 
Y le ^ 


for all t E [r*,pj]. 
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Proof. Suppose t G [f.j,Pj]- By Lemma fS. 101 for sufficiently large N, 



efl Gj{v)dv < 



(8.43) 


Because > Tj, we have < 1, which gives the upper bound in the lemma. 

Now suppose t G [rt,pj]. The same argument that yields ()8.43p implies that for sufficiently 
large N, 


du>{l- 35)eA .) 


— e 






(8.44) 


Now if Tj < pj, then 

- Tj) = — log + —■ 

Qj \sqjj Qj 

For sufficiently large N, part 3 of Lemma [8.81 gives qj > (1 — 2S)k]\f when Tj < pj, which by 
assumption A1 implies that s{f,j — Tj) —)• 0 and therefore ^ 1 uniformly in j as —)• oo. 

Furthermore, if t > tJ, then < Q-^^N/^TkN q g^g jy ^ oo by ()1.7p . Consequently, the 

lower bound in the lemma follows from (I8.44F □ 


It remains to show that the second term on the right-hand side of (I8.42h is small. We know 
from Corollary 14.41 that the process {Z'{f^j + t),t > 0) is a mean zero martingale, so the problem is 
to control the fluctuations of this process. The next result gives the key second moment estimate. 


Lemma 8.12. For sufficiently large N, we have, for all t >0, 

Var(Z'(e,- + t)\F^,) < 5e4 ^ 

Proof. By Corollary 14.41 we have 


Var(Z'(e,+t)|T-e,) 

= E 


3 ^3 

fe+i)Ap_ 


- ^3 


' e {yiXj_i{u) + Bf'"^+^\u)Xj^ 2 {u) + Df’"^+^\u)Xj^ 2 {u)) du 


We now can use the reasoning leading to pS.Sp to get 


Y&i{Z'j{ij+t)\F^.)<E 


i. 


—2 (f Gj{v) dv 

e L 


{pXj-i{u) + 3Xj- 2 (u))l{„<p.} du 




Using Lemma fS. 101 we get that if u < pj, then 

= (1 + 3S)sp’i 


dv 




(8.46) 


(8.47) 
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Also, from (18.421) and (|8.47l) . if u < pj, then 

Xj^ 2 {u) = / pXj-i{w)e dw + Zj{u) 

J C-i 




e 


< (1 + 3<5)se^"/ J dw + Z'j{u) 

= (1 + 2 , 5 ) _ g-s(«-r,)) ^ 

Combining (18.461) . (j8.47j) . and (I8.48j) . and using that 3(1 + 35) < 4 by (13.ip . we get 


(8.48) 


Var(Z'(e,+t)|J-^^.)<4ii;[ 

L J 


- /“ G^{v) dv Giiv) dv ^^^-s{u-rj) 


e 


+ _ e-(“-.)) + Z'iu))l^^^-,^j du 


Tp. 


(8.49) 


By part 2 of Lemma 18.81 we have Gj{v) > (1 — 25)skpq for all v E \Tj,Pj), which means that for 
u> ij, we have 

G,{v) < g-(l-25)sfc^(n-^,)^ (g 5Q) 

Also, although Zj{u) can be negative, it can be seen from (I8.46p that the integrand in (I8.49p 
must be nonnegative so, in particular. 


J,/ Gj{v) dv ^ g-.fe-r,) _ g-s(«-r,)) ^ > q 


(8.51) 


for u E [Cj,Pj)- Because s < 1 for sufficiently large N, we see that se~^G-'rj) _ Q-s{u-Tj) jg 
increasing function of u. Also, Zj{u) = Z'-{pj) for all u > pj. Therefore, (I8.5ip holds for all 
u>ij- Thus, combining (I8.49P and (|8.50l) gives 


Var(Z'(e,+t)|J-^^.)<4L;[ f' 

- •'S 3 


-il-25)ski^(u-^j) Gj{v) dv j ^^-s(u-Tj) 




{se 


+ ')) + Z'jiu)) du 




(8.52) 


Every expression in the integrand in ()8.52p is -measurable except Z'-{u). Since {Z'> 0) 
is a mean zero martingale by Corollary 14.41 we can apply Fubini’s Theorem and then evaluate 
the conditional expectation in (I8.52p to get 

Var(Z'(e,+t)|J-g^.) 

< ^ (^g^-s(u-Tj) _|_ _ g-s(n-Tj)^ 

4 

Now for all u > ^j, 

_1_ g-^lSj—fj) _ g-^fw-T-j) < s + g-^lSj-T-dj-]^ _ g-sfw-Sd) 

< s -|- • s{u — ^j) 

< s{l + u-Cj), 
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so for sufficiently large N, 


Var(Z'(^j + f)| J-^,) < 4e4' ^ ^ 

< 4e4' r e-(i-25)^fcivy5^i + ^y 

Jo 


= 4eJ^j 


+ 


(1 — 26)skj\f ((1 — 25)skj^Y 


< 


5 g/T/ Gji-^) dv _ 


sklj' 


as claimed. □ 

Lemma 8.13. For sufficiently large N, if < pj, then 

fij 

/ Gj{v) dv > sqj{^j — Tj) — 6 . 

Jtj 

Proof. Suppose < pj. Consider first the case in which j ^ Q. Then for sufficiently large N, 
we have Gj{v) > s{qj — Cg) for v G by part 1 of Lemma [8?8l Therefore, 


rij 

/ Gjiv) 

J To 


dv > s{qj - Gg){^j - Tj) = sqjiij - Tj) - Ggs{^j - Tj), 


and the result follows because s{f,j — Tj) —0 as —>■ oo by the argument following (18.451) . 

Next, suppose j G 0, which means qj = j — kj^. Using (|8.40p . we get 

2oAr ^ c ^ t fn < 1 A 

" “ ly - V ^ 

By Proposition 13.51 if t < ajsf A pj then M{t) <3. If oat < t < 7 fc*+i A pj, then M{t) < /cat + 04 . 
In view of p.26p . if 7 fc*+i < t < {on + (2 + l/ST)aN/k^) A pj, then t < 7fc*+8 and therefore 
M{t) <k* + 7 + 2 C 5 . Combining the results for these three cases, there is a positive constant C 
such that if t G [Tj,fj], then M{t) < kjy + G. It follows that 




G, 


fio 

(u) dv> {s{j - kN - G) - p) dv = {s{qj - G) - p){ij - Tj), 
J To 


and the result follows because (Cs + p){fj — Tj) —)• 0 as iV —)• 00 . 
Lemma 8.14. For sufficiently large N, 


□ 


Pi e 


ff Gn(v)dv. . 5 P Giiv) dv , £ 

\z'^{t)\ < for all t G [fj,pj]j > 1 " 

Proof. By the Maximum Inequality and Lemma 18.121 

p( sup |Z'(ej + t)| > dv \ < ^e- 2 /'; G^v) dv . Var(z;.(ej + f)|T^,) 

\t>0 2 / 0 ^ t>0 

< 80 ifjGAv)dv 

d'^skff 


(8.53) 
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By Lemma 18.131 and part 3 of Lemma 18.81 if < pj then 


g-J-r/ GjW dv 


< 


= e<5e-; 


SQj < 3e ^sk]\f. 


Plugging this result into (18.5311 . then taking expectations and using (j3.14p and the fact that 
J < ATk^ for sufficiently large N, we get that for sufficiently large N, 



^ 240e-^ ^ 960e-^r 
- 5‘^kN - S^J 


e 


The lemma follows. 


□ 


Combining (j8.42l) with Lemmas 18.111 and 18.141 and then summing over j immediately yields 
the following corollary, which shows that the result of part 2 of Proposition 13.31 holds with high 
probability. 

Corollary 8.15. For sufficiently large N, 

pf{Xj^ 2 {t) < (1 — 4(5)e'^'^J for some t G [r*, pj]} 

j=k*+i ^ 

U {Xjp{t) > (1 + AS)e-^"i for some t G [fj,Pj]}^ < 


8.3 Early type j individuals before time Tj+i 

In this subsection, we continue to assume j G {fc* + l,..., J}. We consider early type j individuals, 
which are descended from type j mutations that occur at or before the time We will show 
that the claims of part 1 of Proposition 13.31 hold with high probability. Note that (13.181) involves 
a constant C 3 , which we will define to be 


C 3 


2046T 

e 


(8.54) 


We will assume throughout this section that N is large enough that the conclusions of Lemma 
18.91 hold. 

From part 1 of Proposition 13.31 we know that if j > k* + 2, then no early type j — 1 individual 
acquires a jth mutation until time Tj A pj A ajyT. In particular, no type j individual can appear 
until time A pj. This result is also true when j = k* + 1 if we define = t* because, 
according to Proposition 13.11 on {^0 = 00 }, no individuals of type A:* + 1 appear until after time 
t*. Therefore, using the notation from Corollary 14.41 in which denotes the number of 

type j individuals at time t descended from individuals that acquired a jth mutation during the 
time interval (rt, u], as long as < pj, we have 


Xj^iit) = xf^-^'"^\t) + \t) + xf^ 


We will consider these three processes separately. 


(8.55) 
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Lemma 8.16. Let {Z(t),t > 0) be a continuous-time birth and death process in which each 
individual independently dies at rate u > D and gives birth to a new individual at rate X > u. 
Assume that Z{0) = 1. Then 


Also, i/n E N, then 

P{8upZ(()>n) = ^E_^. (8.57) 


Proof. It is well-known (see section 5 of Chapter III in [T]) that the generating function for this 
process is 


F[s,t) = Y,P{Z{t) = k)s'^ 

k=0 


iy(s — 1) — e P —i^) 

X(s — 1) — — n) 


Because P(Z(t) > 0) = 1 — F(0,t), the result (|8.56p follows after some algebra. 

Also, at any given time, the probability that the next event is a birth is X/(X + n), while the 
probability that the next event is a death is n/(XFn). Therefore, (18.5711 follows from well-known 
results for asymmetric random walks (see, for example, section 3 of chapter 3 in [l3]). □ 


Lemma 8.17. Suppose k is an {Ft)t>o stopping time such that < k < and, with positive 
probability, a type j mutation occurs at time k. For sufficiently large N, the following hold: 


1. Given that a type j mutation occurs at time k, the probability that the number of type j 
descendants of this mutation exceeds {s/p.)^~^ before time pj is at most Ssk^- 

2. Given that a type j mutation occurs at time n, the probability that n -\- /^Tk^ < pj and 

at least one type j individual descended from this mutation is alive at time n + ajy/8Tk]\f 
is at most 3sk]\f. 


Proof. Suppose a type j mutation occurs at time n. By the reasoning leading to (14.ip . each 
type j descendant of the individual that gets this mutation gives birth at rate less than or 
equal to 1 -|- s{j — M{t)). Since s{j — M{f)) = Gj{f) -|- p, it follows from Lemma 18.81 that 
until time pj, the birth rate is at most A = 1 -|- (e -|- 25)sk^- As long as the number of type 
j individuals descended from this mutation is less than {s/p)^~^, the reasoning leading to ()4.2p 
implies that the rate at which each such individual either acquires a mutation or dies and gets 
replaced by an individual that is not a type j individual descended from this mutation is at least 
/i -|- 1 — {s/pY~^{ 1 -P s{j — M{t)))/N. Using Lemma (8^ and (jl.Op . we see that for sufficiently 
large N, this quantity is at least n = p + 1 — 5ski\f until time pj. Therefore, until time pj occurs 
or the number of type j individuals descended from this mutation reaches {s/p)^~^, the number 
of such individuals is dominated by a continuous-time branching process in which each individual 
gives birth at rate A and dies at rate n. 

By Lemma 18.161 the probability that the number of type j individuals descended from this 
mutation exceeds {s/p)^~^ before time pj is at most 


1 — u/X 

1 - ' 


(8.58) 
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Likewise, the probability that k + a]\f/8Tk]\f < pj and at least one type j individual descended 
from this mutation is alive at time k + ajsf/STkjsf is less than or equal to 


\ — j/g—(A—!^)(ajv/8Tfejv) ■ \ ■ J 

We must show that the expressions in ()8.58p and (18.591) are bounded above by 3skf^ for sufficiently 
large N. We have 

1 — i>/X<X — u<{e + 36)sk]\f. 

Because e + 35 < 3 by ()3.ip . it remains only to show that the denominators of the expres¬ 
sions in ()8.58p and (|8.59p tend to one as —>■ oo. If iV is large enough that i' < 1, then 

we have < (1 -|- (e -|- 26)sk]\[)~^^^>^'>^ , which tends to zero as —>■ oo because 

{skN){s/p.y~^ —>• oo as A^ —)• oo by (jl.8l) . Likewise, ^ q as N ^ oo because 

(A — u)a]\[/8TkN > (e -|- 36)saN/8T —^ oo as A^ —)• oo. The result follows. □ 

Lemmas 18.181 [8T^ and 18.201 below give us the bounds that we will need to establish that the 
result of part 1 of Proposition 13.31 holds with high probability. We will use the notation 
for a collection of probabilities pj^N such that 


lim k]\f sup pj^N = 0. 

N^oo 

Lemma 18.181 shows that it is highly unlikely that any type j mutations appearing before time 
Tj will have descendants alive in the population after time rt. As a result, it will be possible 
essentially to ignore such mutations. 


Lemma 8.18. We have 

for some t G [^j-i,T* A pj^ = (8.60) 

and 

> 0 for some t G [tj , Pj]) = o{kfy). (8.61) 

Proof. Write pj = tj A pj. Suppose first that j > k* + 2. Because pj < ftie result of part 

2 of Proposition 13.31 holds for type j — I individuals up to time pj, which means 


rpj 

Jij-i 


pXj_ip{t) dt < p{l + 


fpj 

45) / 
Jij-i 


If. Gj-i{v) dv 


dt. 


Also, since pj < Ci,i-i 5 Lemma l6 .1 1 implies that 


jff-i Gj-iiv) dv 


2s 

h 


for sufficiently large N, which leads to 




e-fy Gj-i{v)dv 


(8.62) 
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Now suppose instead that j = k* + 1, and recall that = t* by definition. Then because 
Pj < Cij-i) the result of part 1 of Proposition 13.21 gives 

r dt < p{l + 6) 

Jij-1 ?j-i 

Reasoning as in the proof of Lemma l6.II but using (I3.12jl . we get that for sufficiently large N^ 

so (|8.62p holds in this case as well. Therefore, combining (18.621) with part 2 of Lemma 18.81 and 
writing Cio = 2(1 + 4<5)/(l — 26), we get 


r pXj_i 2 {t) dt < 2s{l + 45) g-(l-25)sfciv(p,-b 

4-1 ’ Jij-1 

Because pj < Cij-i; the last statement of part 1 of Proposition 13.31 implies that no early 
type J — 1 individual acquires a jth mutation before time pj. Because each type j — 1 individual 
acquires mutations at rate p, the number of times that type j — 1 individuals that are not early 
acquire a jth mutation between the times and 

inf ju : J pXj_i^ 2 it) dt > 

is Poisson with mean Cio/kjsf. In particular, the probability that at least one such mutation 
occurs during this time period is at most Cio/fejv, and the probability that two or more such 
mutations occur during this time period is at most C^q/A;^. If such a mutation occurs before time 
Pj, then by Lemma [8.171 the probability that the number of type j descendants of this mutation 
exceeds {s/p)^~^ before time pj is at most Ssk^- Likewise, the probability that some type j 
descendant of this individual is still alive at time r* A pj is at most 3sk]\f. Thus, the probabilities 
of the events in (j8.6Up and (I8.6ip are both bounded above by 


4o ^ Clio , , 


This expression is o{kj^^) because sk^ —)• 0 as N" ^ oo by assumption A3. □ 

Lemma [8.191 bounds the probability that, when j ^ 0, we have an early type j mutation with 
descendants alive after time Tj . This bound is given in (18.641) below. A sharper bound is given 
in (j8.63l) for the probability that such a mutation occurs before time . 

Lemma 8.19. For sufficiently large N, we have 


P{{X^p’^^ \t) > 0 for some t E [rj,pj]} n {j ^ 0}) < (8.63) 

and 

r J- 1 1 

P{{Xj ’^^\t) > 0 for some t E [T*,pj]] n {j ^ 0}) < ——. (8.64) 

Kn 
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Proof. By Lemma 18.101 and part 1 of Lemma 18.81 on the event {j ^ 0}, we have 



/rXj_i(t) dt < 


< 



ff Gj-i(v) dv 


dt 


f^s{qj+C 9 )it-Tj) 


< (1 + 35)s • 


^s(qj+C9){i. -T-j) 

s{qj + 6*9) 


(8.65) 


By part 3 of Lemma [8.81 we have (1 + 35)/{qj + Cg) < 2/kfq for sufficiently large N. Also, 
recalling (18.391) and observing that log(l/sgj)/gj —)• 0 as ^ oo on {vj < pj} by assumption A1 
and part 3 of Lemma IHTSl we get that for sufficiently large N, on {tj < pj}, 


^s(qj+C9)(^j -Tj) 





C^\ ^ ^ 

Qj J ~ sqj ' 


( 8 . 66 ) 


Therefore, on the event {j ^ 0}, we have 



pXj-i{t) dt < 


4e-^ 
skNqj' 


(8.67) 


Likewise, if we replace by f,j in (|8.65p . (|8.66p . and (|8.67p . we get that on the event {j ^ 0}, 



( 8 . 68 ) 


Let Li be the number of type j mutations between times Tj and /\pj, and let r 2 be the num¬ 
ber of type j mutations between times tj and A pj. Because each type j — I individual acquires 
mutations at rate p, equations (I8.67P and (|8.68l) imply that /{skiqqj) and 

E\r 2 ^{j^Q}\d^Tj\ < /{skp^qj). Let Aj be the event that rj < pj and the individual that gets 

the ith type j mutation between times Tj and f,j has type j descendants alive at time Tj. By 
Lemma [8.9l this individual must have type j descendants alive for at least a time /KPkjq after 
the time of the mutation. Therefore, by Lemma 18.171 we have P(Aj|r > i) < Sskjsr. Using part 
3 of Lemma ISTHl equation ()3.1I) . and the fact that J/k^ < 4T for sufficiently large N, we get 


ri 


p({j^e}u[jA, 


2 = 1 


1 12e"® 13e-^ e 832Te 

TVj ) < 3sA:7vLl[ril{j^0j|Jv^.] < - < —- < 


-b 


Qj kN 

Equation (I8.63|) follows because e~^ < e/332T by (I3.14|) . Likewise, 

^2 \ -\ n b 


16J 


p({j ^ 0} U [J Aj < 3skNE[r2t{j^ej\X, 

^ i=l ' 

which implies (I8.64h . 




126® 13e® 

< 


qj kN 


□ 
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Lemma 8.20. For sufficiently large N, on the event {pj > Tj}, we have 


^ for some t G [tj, r^+i A Pj]\iFrj) < 


If Gjiv) dv 


97 


and 


Also, 


P{xf^ ’^'\t) > C'se-'A for some t G [Tj,Tj+i A pj]\Prj) < 


If Gjiv) dv 


17 J 


p(^Xp’^^\t) > for some t G [Tj,T* A pj]^ = o{k^^). 

Furthermore, 118.69\) holds even if j is random, as long as Tj is a stopping time. 
Proof. Let pj = Tj+i A pj. Using the notation of Corollary 14.41 if t > tj, then 


(8.69) 

(8.70) 

(8.71) 


e~ ^ (v) dv [Tj ,^j 


’'^i\tApj)= j pXj_i{u)e du +Z^p'^^\tApj). (8.72) 

J Tn 


By Corollary 14.41 the process + t), t > 0) is a martingale. Therefore, if we define 




Y{t) = e~ dv^[rj,ij] ^ 


for all t > Tj, then the process {Y{t),t > Tj), haying been expressed in (|8.72n as the sum of an 
increasing process and a martingale, is a submartingale. By Doob’s Maximal Inequality, 

P{Y{t) > Cs for some t G [r,-,p,]|JV,) < ^E[Y{pj)\P^f. (8.73) 


By ()8.72p and Lemma [ 8 .101 

E[Y{pj)\F^f=E 


rpjhij 
J Tn 


pXj-i{u)e ’’a 


C 3 

- If Gjiv) dv 


du 


T 


< (1 + 35)sE 


<1 + 3(5. 


r pj A+ 


du 


Tv. 


(8.74) 

Now ()8.69l) follows immediately from (|8.73l) and (I8.74p . as long as C 3 > 97(1 + 35)je, which is 
true by ^8.541) . Note that Remark 14.51 implies that p8.69p holds when j is random, proyided that 
Tj is a stopping time. 

To obtain (I8.7ip . note that if f < r* A pj, then by part 2 of Lemma ISTSl we haye 

-(e+25)/4T 

f I *rmt XZ/ii^C-Nj \ ! n \ 

> e“ 


-IrTjiv)dv ^ ^_(ajv/ 4 rfcjv)((e+ 2 < 5 )sfcjv) = f £. 




Therefore, 


p{ x^;^'^^\t) > 


h 


l-(5 


for some t G [Tj,Tj A pj]^ 


'(yit) 


= P{ Y(t) > e 


-ffGj(v)dv f S 


1-5 


<p(Y{t) 


s 


F 

l-5-(e+25)/4r 


for some t G [TjTj Pj] 


for some t G [Tj,T* A pj] 
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Write 9 = 1 — 8 — {e + 28)/4T, which is positive by (13.111 . Arguing as in the derivations of (I8.73P 
and (|8.74ll but using (s//u)^ in place of C 3 and r* A pj in place of pj, we get 


^ for some t et e [Tj,T* A pj]^ < (1 + 

The result (j8.7ip follows because {s/p)~^kN —)• 0 as A" —>■ 00 , as can be seen by taking the 
logarithm and using (HZl). 

The argument for (I8.70p is similar to the argument for (I8.69p . Again using Corollary 14.41 we 
have 


e 


^3 


1 r 

(t Apj) =J 

^3 

where +1), t > 0) is a martingale. For t > , let 


tA^jApj _ r“ Gj(v) dv ff- c.l 

pXj-i{u)e du + ZP ’ ^ (t A Pj), (8.75) 






W{t) = e b' 




By ()8.75p . the process {W{P +t),t > 0) is a submartingale. By Doob’s Maximal Inequality, 


p(g-T/ Gj{v) ^ ^ ^ [^~,Pj]\X.-) < - -^ 7 ^- E[W{pj)\P.-]. (8.76) 

■' 03 

By (|8.75p and Lemma 18. 101 

E[W{p,)\T^^] = E 

^7 


g-/r/ Gj(v)dv 


< (1 + 38)sE 


CPjA^j _ Gj{v) dv 

pXj-i{u)e b du 


^7 


rpjhij rGi-i(v)dv -!XGj{v)dv 

^ ’ e du 


14?: 
«7 


^3 


rPjAij 

/ du 

- 


h~ 

^3 

^3 


= (1 + 38)se^-^ ^^E 

< (1 + 38)s{^j - i~)e^'^i 

^ 2(1+ 3.5)6 G,{v) dv 

^3 

Since qj > (1 — 28)/k^ on {tj < pj} for sufficiently large N by part 3 of Lemma ISTSl it follows 
that for sufficiently large N, we have, on {tj < pj}, 

ztTti//'- M 77 1 ^ [t^ Gj{v) dv 

E[W{pj)\E^-]< — -eJ^^ ' • 

Therefore, recalling (18.5411 and noting that J < 4TA:Ar for sufficiently large N, we get 

P(e“^A ^^■(^)'^’'lT(f) > C3 for some t G [Tj,pj]\E^-) < 

Taking conditional expectations of both sides with respect to TV, yields (I8.70p . □ 
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We now combine Lemmas 18.18118.191 and 18.201 to establish that the result of part 1 of Propo¬ 
sition [331 holds with high probability. In view of the fact that 2{s/^Y~^ < s/2ijl for sufficiently 
large N, Proposition 18.211 establishes the hrst two statements of this result. Proposition 18.221 
establishes the last statement. 

Proposition 8.21. For sufficiently large N, 

^ P (t) > for some t £ [t *, Tj+i A pj ]} 

j=k*+l ^ 

U some t < t* A 

Proof. Recall from the discussion before (I8.55P that if a type j individual appears before time 
then pj occurs at that time, so we only need to consider type j mutations after time fj-i- 
Combining (I8.60D and (18.711) . we see that for sufficiently large N, 

P{Xj^i{t) > 2{s/ for some t < t* A pj) = o{kfl). (8.77) 

By (I835I1 . (I8RTI1 . (18331) . and (IHTni) . 

X] for some t G [r*, r^+i A pf] n {j ^ 0}) < | (8.78) 

j=k*+l 

for sufficiently large N. Because we observed that there can be at most 12 values of j for which 
Tj < Pj and j G 0, it follows from (I8.6ip and (I8.69p that for sufficiently large N, 

P{{Xj^i{t) > Cse-^A for some t G [r*, Tj+i A pj]} n {j G 0}) < |. (8.79) 

j=k*+i 

Note that the values of j that are in 0 are random, so we are using the statement in Lemma (8.201 
that (|8.69l) holds when j is random, as long as tj is a stopping time. □ 

Proposition 8.22. Let pj = Tj^iApj. Let Aj be the event that an early type j individual acquires 
a {j + l)st mutation at or before time pj. Let 

El = [Xj^i{t) < 2{s/p)^~^ for all t < t* A pj and j G {k* -|- 1,..., J}} 

E 2 = {Xj^i{t) < C'ae'^A' for all t G [r*, pj] and j G {k* -|- 1,..., J}} 

E3 = {(1 - 4 S)J‘l ^ < Xj^t) < (1 + 

for all t G [T*,Pj] and j G {k* -|- 1,..., J}}. 


Then, for sufficiently large N, 

^(( U 0311132033) 

^^j=k*+l ' ' 
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Proof. We first bound the probability that an early type j individual gets a (j + l)st mutation 
between times (,j-i and r*. When Ei occurs, we have, using ()3.26p . 


dt < 2^1 -] {t* a pj - Cj_i) 

j=A:*+l j=k*+l 


< 2p{J-k*) 




kN 



0 


(8.80) 


as —)• oo. Because each type j individual acquires mutations at rate p, the expression on the 
right-hand side of (|8.80p bounds the probability that Ei occurs and, for some j G {k* -|-1,..., J}, 
an early type j individual gets another mutation between times and t*. 

Consider next the possibility that such a mutation occurs between times r* and Tj+i A pj. In 
view of (j8.64p and the fact that there are at most 12 values of j for which tj < pj and j G 0, the 

probability that there are fewer than values of j for which Xj^i{t) > 0 for some t G [Tj,Pj] 

1 /2 

tends to one as N ^ oo. Suppose there are indeed fewer than such values of j, and suppose 
E 2 and E^ occur. Then, for sufficiently large N, 


jff Gj(v) dv ^ Xj^2iPj) ^ l + s/p ^ 2s 

~ 1 — 4(5 “ 1 — 4(5 ~ p 

Therefore, using part 2 of Lemma l8.8p 


Y [ l^^j,lit) < Ciik]i^P [ 

1.* , 1 J T* J T.* 


/*. Gj{v) dv 


dt 


j=k*+l i 


= CspiPel’l r e-JP Cj(") *■ dt 


<C3kl'P- 


2s fPi 


-{l-2S)skff{pj-t) 


< 


2 C 3 


(1 - 26)k]{^ 


(8.81) 


as N ^ 00 . The expression on the right-hand side of (18.810 bounds the probability that for some 
j G {k* -|- 1,..., J}, an early type j individual gets another mutation between times r* and pj. 
Equations (I8.80p and (|8.81l) thus imply that the probability that Ei, E 2 , and E^ occur but Aj 
also occurs for some j G {k* -|- 1,..., J} tends to zero as —)• 00 . The result follows. □ 


8.4 Type j individuals between times and 7 j+ic 

In this subsection, we show that the number of type j individuals behaves quite predictably 
between times rj+i and Jj+K- In particular, we show that the result of part 3 of Proposition 13.31 
holds with high probability. The key to the argument will be showing that the fluctuations in 
Xj{t) are small. We assume throughout the subsection that j G {k* -|- 1,..., J}. Let 

P'j = Pj ^j+K- 
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We apply Corollary 14.31 with tj+i in place of k and p', in place of r to get that for t > tj+i, 


XjitApj) = f 


+ (8.82) 


' r;+i 


To lighten notation, we will set 




and then the process {Z'-{Tj+i + t),t > 0) is a mean zero martingale. By definition, we have 
s/p < Xj^Tj^i) < 1 + s/p, so the first term in (j8.82p is very close to the expression in ()3.20l) . 
Therefore, to show that (I3.20p holds with high probability, we need to show that the second and 
third terms in (|8.82l) are small relative to the first term with high probability. We begin with a 
result similar to Lemma l8. lUI that holds between times 'jj-i+K and "Jj+K- 

Lemma 8.23. For sufficiently large N, i/ 7 j_i+x <t< p'-, then 


Xj-i{t) < 


(1 + 26')k'^S f/ Gj—i{v) dv 


e V 




Proof. If j > k* + 2, the result is immediate from (j3.2ip . Suppose instead j = k* 1. Then 
(|8.4ip holds. Because (1 + s/p)/[{l — 6){s/p)] <1 + 26 for sufficiently large N, an application of 
(j3.13p then gives the result. □ 


The next lemma controls the second term in (18. 821) . We will consider the event 

Fj = {X,-2(r,+i) > (1 - 45)6^’^'^' 


(8.83) 


By Corollary 18.151 with probability at least 1 — e/25, for all j E {k* + 1,..., J} either Fj occurs 
or Tj+i > pj. 

Lemma 8.24. For sufficiently large N, iftG [rj+i,/?(] and Fj occurs, then 


pXj-i{u)e 




' "E+i 


6s 

du < —. 

6p 


Proof. First suppose Tj+i < u < p' A Then by Lemma ETHI 

, , - r Gi(v)dv , ,, f'‘Gi-i(v)dv -r GPv)dv 

pXj_i{u)e •'a+i ’ < (1 + 35)se-'"^' ' e •'a+i ’ 

= (1 + 36)se^^T' 


On the event Fj, we have 


so for sufficiently large N, on Fj, 


d.v ^ 1 + s/p 


e-’ o 


1 -4(5 ’ 


- . — Gj{v)dv 2s^ r'l 

pXj_i{u)e ■'a+1 ’ < -e 

F 


(8.84) 


(8.85) 
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Next, suppose 'jj-i+x <u<p'y Then by Lemma [8.231 and (|8.84|] . for sufficiently large N, on Fj 
we have 

, , - r GJv)dv _ rG.i-iiv)dv -r GAv)dv 

pXj_i{u)e ■'"^■+1 ’ <{l + 25)k%se-^^i ' e ’ 

-0 + 1 , 


= (1 + 28)k%sJ^i 


0^2 2 

< ^-s(u-Ti) 

k- 


( 8 . 86 ) 


By (I8.85|) and (I 8 . 86 p . if t € [rj+i,p() then on Fj, 


pXj_i{u)e 


Gj{v)dv 


du < 


' rj'+i 


2^2 HA'Tj-i+K 

k Jr, + o 


e-<u-r^) du + 


2 kls^ 

k 




’ij-l+K 


2 s 




(8.87) 


If t G [rj+i,p'), then r^+i < pj, which means Tj+i — tj > a]\f/3kN by ()3.26p and 'Jj-i+k — tj > 
'fj — Tj = Oat. Therefore, by ()1.7p . we have 


s(rj+i -Tj)> 


aNS log(s//r)" 


3k]\[ 31ogAI 


and 




kj\fe 


oo as iV —>■ oo 


0 as N ^ oo. 


The lemma follows from p8.87p . p 8 . 88 p . and (18.891) . 


( 8 . 88 ) 

(8.89) 

□ 


It remains to bound the third term on the right-hand side of p8.82p . To bound this term, we 
will need to control the fluctuations of the martingale (Z7(rj+i + t),t > 0). Lemma 18.271 below 
gives the required second moment bound. Before stating this lemma, we provide some estimates 
on Gj{v) in the following two lemmas. 

Lemma 8.25. For sufficiently large N, if j > k* + 1 + K and u G [Tj^i,'fj-K A p'j), or if 
k* + l<j<k* + K and u G [tj+i, on A p'j), then 


dv ^ g-sfcjv(u-rj+i)/5 


(8.90) 


Proof. We will use the results of Proposition 13.51 which by definition hold up to time p'j. Also, 
recall that K = [/sAr/dJ. First, suppose j >k* + 1 + K and t < p'j. If t < a^i, then by part 1 of 
Proposition 13.51 for sufficiently large N, 


Gj{t) = s{j - M{t)) - p > s{j - 3) - p > 


skj\f 


jGo -w — V-/. . ^-5 

If t G (uAf, 7fc*-i-i), then by part 2 of Proposition 13.51 for sufficiently large N, 
Gj{t) = s{j - M{t)) - p> s{j - /cat - C' 4 ) - /i > 

o 

If t G ['fk*+i,'fj-K), then by part 3 of Proposition 13.51 for sufficiently large N, 
Gj{t) = s{j - M{t)) - p> s{j - {j - K -1) - 2 G 5 ) - p> 


(8.91) 


(8.92) 


(8.93) 
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Combining (18.911) . (18.921) . and (|8.93p . we get for u G [rj+i,A p') 

[ Gj{v)dv 

JTj + l 0 

which leads to (jS.hOj) . Next, suppose k* + l<j<k* + K. If t < oat A p'-, then (j8.9ip holds as 
before, which again yields (18.901) . □ 

Lemma 8.26. For sufficiently large N, if j > k* + 1 + K and u G [jj- K,p'j), orifk* + l<j< 
k* + K and u G [a 7 v,Pj); then 


— /“ Gj (v) dv 
e A+i ' 




-A;jv/241 


Proof. Recall the definition of M{t) from (I7.16p . Write 7 * = 7 fc*+i if /c* + 1 < j < fc* + ii' and 
7 * = 'jj-K if j > k* + 1 + K. Also, write i = k* + 1 if k* + 1 < j < k* + K and i = j — K 
if j > k* + 1 + K. Suppose u G [ 7 *,p'), and note from the definition of p'- that this means 
u < 'Jj+K- Now 

ru pu 

/ Gj{v) dv = / [s{j — M{v)) + s{M{v) — M{v)) — p) dv 

J 'y* J 'y* 


^ r-yi+iAu 


E 


(s(j — i) + s{M{v) — M{v)) — p) dv. 


(8.94) 


i=i 

Because positive terms can be bounded below by zero, we have, using (|3.26p . 


^ rit+iAu 


j+K-1 


E 


£=i 


s{j-£)dv> ^ s{j - £){ye+i/\u - ye Au) 


e=j+i 

zajsf , . 

-17 ^ 


^=j+i 

K(K — l)soAr 
kN 

Using p3.26p and p7.17p and the fact that there are at most 2K terms in the sum, we get 


(8.95) 


j+K 1 


l=i 


l')tAu 


{^s{M (u) — M{v)) — p) dv > —2K 2 G 5 -\—• 




(8.96) 


Now since s/p—^-ocasA"—)-ooby (jl.8l) and saN/kiy ^ 00 as A ^ 00 by pi.7p . we have 
4 G 5 + 4ajv/u/kjv < sa]\f/kN for sufficiently large A. Combining this observation with (|8.94|) . 
(I8.95p . and (|8.96l) yields 


Gj (u) dv > — 


K‘^saj\f kj^saj^ kj^ 


kN 


> - 


=-log 

16 16 ^ 


0 


(8.97) 
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It remains to consider the integral between times Tj+i and 7 *. Suppose first that j > k*+l+K. 
In view of part 3 of Proposition 13.61 for sufficiently large N, as long as 7 * < p'j, we have 

* , ^ ^ ^ ^ 

7 — Tj'+I = Ij-K — Tj-K + Tj-K — Tj+l = Utv — [Tj+1 “ Tj-K) > CLN - 

Thus, assuming that 7 * < p'j, Lemma 18.251 implies that for sufficiently large N, 

Suppose next that k* + l<j<k*+K. Then, as long as a^v < p'j, parts 1 and 3 of Proposition 
13.61 imply that for sufficiently large N, 


aN — Tj+l > aN — Tk*+1 + Tk*+1 — Tj+1 > Oat 


2a7v 

kN 


2KaN ^ UAf 
k]\[ ~ 3 ’ 


and the same reasoning that yields (j8.98p gives 


f Gj{v)dv>^ log 

JTj + l -'-3 



(8.99) 


Now suppose u E [aAr,7A:*+i A p' ). Then for v E {aN,u), since k* + 1 > k~l^ > kj\f and M{v) < 
kjy + C 4 by the result of part 2 of Proposition 13.51 we have 


Gj{y) = s(j — M{v)) — p> s{k* + 1 — kN — G 4 ) — p> —sG^ — p. 


Therefore, since ( 7 fc*+i A p' ) — aN < 2aN/kN by part 1 of Proposition 13.61 for sufficiently large 
N we have 


Gj{v) dv > —{sGi + p){u — qn) > 


' a,N 


By combining (j8.97p and (j8.98p when 
(I8.97p . (|8.99p . and (|8.100p when A;* + 1 
large N in both cases. 


2 {sG 4 + p)aN ^ 2 (C 4 + p/s) /£\ ^ /£\ 

kN kN ^ \pj ~ ^ \dj 

( 8 . 100 ) 

j > k* + 1 + K and u E [ 7 j_A',p'), and by combining 
< j <k* + K and u E [otv, p'j)-, we obtain for sufficiently 





>—log 
- 241 ^ 


( 7 )- 


The result of the lemma follows. 


□ 


Lemma 8.27. For sufficiently large N, we have, for all t > 0, 

Var(Z;(Tj+i +t)|j;,^J < ^ 


on the event Fj. 
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Proof. By Corollary 14.31 and (|5.5I) . for all t>0, 
Var(Z"(T,+i <E^ 


m+^ 

J TjJ^l 


'^j + 1 


{nXj-i{u) + 3Xj(u))l{„<p/} du 


T 

■^Tj + l 


( 8 . 101 ) 

Using ()8.85l) when u < and using (| 8 . 86 l) combined with ()8.89p when u > 'jj-i+K, we 

obtain that if ri G [tj+i,/?'■), then for sufficiently large N, 


Gj{v)dv „ ,2s 


e ^ 3+1 






( 8 . 102 ) 


Also, by ()8.82p and Lemma [8.241 if n G [Tj+i,p'), then on the event Fj, for sufficiently large N, 

'^Xj{u) = Xj{Tj+i) + f fiXj_i{w)e '^'^^+1 drc + Z"(n) 

Tj+i 


— ff Gj (v) dv , 
e G+i ' 


^ s 5s , , 
< 1 H h -—\- Zj {u). 


(8.103) 


Combining (|8.101l) . (I8.102p . and (I8.103p . and noting that 2s^ j + 3(1 + s//x + 5sl‘i[i) < 4s/;U for 
sufficiently large A^, we get 


Var(Z''(r,+i+t)|J-.^.^,)<L; 


/’ 


+ — r“ Gj(v)dv f 4S r:,/// \ 1 

e ■^"■^■+1 ( — + (u) ) l{u<p'.} du 

L^Tj + l V " 






Tj'+i 


(8.104) 


on Fj for sufficiently large N. 

To bound the right-hand side of (|8.104p . we split the integral into two pieces. Let 7 ' = 'Jj-k 
ii j > k* + 1 + K, and let 7 ' = oat if fc* -|- 1 < j < A;* -|- AT. Consider first the contribution to the 
integral from u < 7 '. Because the integrand in p8.10ip is nonnegative and Z”{u) = Z”{p'-) for all 
u> p'j, we have As/ p + 2>Z"{u) > 0 for all u > Tj+i. Then by Lemma [8.251 for all u > 0, 


d-'’’ ( 4s 


G'+i 


-^ ] ^{«<7'ApU < 6 

p ’ 


-skj^{u-T^+,)/5 ^ 3Z''(u) 


Combining this observation with Fubini’s Theorem and the fact that {Z"{Tj+i + t),t > 0) is a 
mean zero martingale, we get for all t > 0 , 


F 


Tj + l+t _p GPv)dv/'As 


e ""j+i 


LJTj + l 


T (^) ) ^{n<7'ApU du 

A 




"G+i 


< F 


f 


L Xj^i 

As 


A 
20 

pkN' 


poo 

J Tj^l 


^-skM{u-rj+i)/5^^ ^ 3Z"(u) ) du 

-sk]^ («-r,+i)/5 


F 

J n 


Tj + 1 


(8.105) 


Likewise, by Lemma 18.261 


— r“ Gj{v) dv f As 

h 


) / \ —/cai/241 / , 


(8.106) 







































Also, using (13.261) . which is valid up to time p' 


I , 2Ka]\i Sa^v 

Pj — Tj+l = UN + Pj — 'Yj+i < aN 


(8.107) 


Combining (|8.106l) and (j8.107p with with Fubini’s Theorem and the fact that {Z'J{Tj+i + t),t > 0) 
is a mean zero martingale, we get for all t > 0 , 


E 


f 

J Ti 


Tj+i+t _p GiMdv/As 

^ 'JTa\a J \ y i 


e ""j+i 


L J + l 


— + 3Zj (u) ] l{ 7 '<«<p'} du 




Tj + l 


< E 


L Tj + l 


Tj+i+3ajv/2 / „ \ —fcjv/241 / a 

' ^ (- + 3Z"{u)]du 


P 


P 


T 

J n 




^ 6 sojv / s 
~ P \P 


6 /s 
P\P 


-fcjv/241 

-fcjv/241 

log 


(:)■ 


(8.108) 


Because k]^{s/p) \og{s/p) —)• 0 as iV ^ oo, as can be easily seen by taking logarithms, 

the lemma follows from (I8.104p . (I8.105p . and (I8.108p . □ 

Lemma 8.28. For sufficiently large N, 


p(J^\Zjit)\ > ^ for some t G [r,+i,p']| n Fj^ < 


756/r 


6 ‘^s‘^kN 

Proof. By the Maximum Inequality for martingales and Lemma 18.271 on the event Fj, 

5s 


p( sup|Z"(r,+i + t)| > 

V t>0 3p 


T 


^ ^ 36^2 750^ 

) S ^ Var(Z, (r,+i + t)Vx,„) < -gT^- 


Taking expectations of both sides yields the result. 

Corollary 8.29. For sufficiently large N, 

'' '' (1 - 5)s ^f*^^^^ Gj(v} dv (1 + Gjjv) dv 


□ 


j=k*+l 


P 


P 


for some t G [rj+i,p'] ) < 


Proof. By p8.82p . Lemmas 18.241 and 18.281 and the fact that s/p < Alj(rj+i) < {s/p){l + 5/3) for 
sufficiently large N by (II. 8 p . we have 


j=k*+i 


iUiEgA+i 


p 


p 


for some t G [rj+i,p(] 


S E + {«>-.«)) 


j=k* + l 


Because Ylj=k*+i d^i^^j^iPj — T/'+ii) — ^/25 by Corollarv l 8 .isl and ,Jp/{5"^s’^kj^s) 0 as N ^ oo 
by (jl. 8 p . the result follows. □ 
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8.5 Type j individuals after time 'Jj+k 

In this subsection, we show that the number of type j individuals decreases rapidly after time 
"fj+K- More specifically, we show that the results of parts 4 and 5 of Proposition 13.31 hold with 
high probability. We will consider the event 




I J 


By Corollary 18.291 when t = Jj+K, with probability at least 1 — e/24, for all j E {k* + 1,..., J} 
either Hj occurs or 'Jj+k > Pj■ Recall also the definition of the event Fj from (I8.83|] . 

Lemma 8.30. Suppose j E {k* + 1,..., J}. For sufficiently large N, if i < j — 1, then 


^eiu+K) 

^ji^j+K) 



(8.109) 


on the event Fj n Hj n {'Jj+K < Pj}- 


Proof. We will assume throughout the proof that 'Jj+K < Pj- By Lemma [8.231 if k* < i < j — 1 
and 'yj+K < Pj-, then 


X(,{'yj+K) < 


(1 + 25)k%s jfi+f Gt{v) dv 

P 


( 8 . 110 ) 


Therefore, on the event Hj, 


^ein+K) ^ (1 + 26)k% jff+f Gejv) dv fffff(Ge(v)-Gi(v)) dv 
^j(7j+K) ~ 


( 8 . 111 ) 


Recall that on the event Fj, equation (I8.84p holds, and therefore Gj{v)dv < for suffi¬ 

ciently large N. Also, by Lemma l 6 .11 in view of the assumption that 'jj+K < Pj-, the same result 
holds when j is replaced by /i E {k* -|- 1,..., j — 1}. Therefore, for sufficiently large N, we have 


(.J-re+l 


G({v) dv 


h=£+l 


< 



( 8 . 112 ) 


Also, it follows from (j3.26l) that 'jj+K—Tj+i = ON+Tj-^-K—Tj+i > a]sf+aNiK—l)/3kN > 14aAr/13 
for sufficiently large N. Therefore, 

f \ ^ J N/ 14(j-£)saAr 14(j-f) f s\ 

J {Gi{v) - Gj{v)) dv = -s{j - i){-fj+K - Tj+i) < - — -=- — -log ( - 1. 

(8.113) 

By (I8.11ip . (j8. 11211 . and (j8.113h . for sufficiently large N, on the event Fj n Hj n {jj+K < Pj}, we 

X , , (i + 2i)fc^ / 

^jilj+K) l-S V \p) J 

Because 2{s /0 as N ^ oo, for sufficiently large N this expression is largest when 
£ = j — 1, and thus (|8.109p holds whenever k* < i < j — l- 
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Next, suppose 0 < ^ < A;* — 1. Then by (13.121) . we have 

Xe{-ik*+K) > (1 - 

and by (|8.26l) . we have ^£(7A:*+_ft:) < (7fc*+i^)- Therefore, by (13.131) . 

Xi{-ij+K) < k%Xe{t*)e-^t*''^*’' 




r^j+K 


^ V { \ ^k* (^) 

< Y^Xk*{-fk*+K)e-’^^*+^ ^ ’ . 


Now using Lemma 18.101 




which is the same as (j8.11UI) when i = k* except for the constant in front involving 6 . Therefore, 
(18.1091) holds on Fj n Hj n {'Jj+K < Pj} in this case as well. □ 

Proposition 8.31. For sufficiently large N, 

j 


y~l (^) > for some t G {jj+K, Pj] 

U {Xj{t) > 0 for some t G [-fj+L,Pj]}^ < (8.114) 


j=k*+l 


e 

12 ' 


Proof. Suppose j G {A:* + 1,..., J}. Recall that Sj{t) = Xoit) + Xi(t) + • • • + Xj(t) for all t > 0. 
By Proposition 14.61 and Remark 14.71 the process 

r(7j + x+‘)Apj „ , s , 

“■‘”>''”S,((7y+A'+i)A«),t>0) 


is a nonnegative siipermartingale. Therefore, 

— r* Gj (v) dv 


P ( sup e 

' ^^[7i+K )Pj] 


Sj{t) > ^Sji-fj+K) 




(8.115) 


Since j < J < 4rA:Ar for sufficiently large N, on the event Fj n Hj n {'^j+K < Pj} £ 
Lemma 18.301 implies that for sufficiently large N, 


Since k‘^{s /—)• 0 as —)• oo, as can be seen by taking the logarithm and applying 

for sufficiently large N we have Sjf^j+x) < (3/2)Xj(7j_|_x) on Fj nLAj(n{7j+x < Pj}- Combining 
this observation with ()8.115[) gives that for sufficiently large N, 


p{s,{t) > 




F^j+K 


Xj(7j+k) for some t G {-fj+K,Pj] 


- k% 
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on Fj n Hj. Since Xj{t) < Sj{t) for alH > 0 and (3/4)Xj('yj+x) < on Hj, it 

(8.116) 


follows that on Fj n Hj, 


^1 / X k%s r* Gi{v)dv ^ 

P( Xj{t) > -e'^ 3+1 for some t G {'yj+K,Pj] 




- kl 


Also, on the complement of the event in (I8.115h . if pj > jj+L then 






Reasoning exactly as in (18.291) . (|8.30p . (I8.3ip . and p8.32p but with j in place of k*, we get that 
on the complement of the event in (|8.115p . if pj > Jj+L then for sufficiently large N, 

-16fciv/15 


Sjijj+l) < 


Nk% 


As in the discussion following (I8.32p , we see that the right-hand side tends to zero as N ^ oo and 
thus must be less than one if N is large enough. Because Sj{'yj+L) is an integer, it follows that 
Sj{'^j+L) = 0, and therefore that Xj{t) = Sj[t) = 0 for all t > jj+L- Combining this observation 
with (|8.116p . we get that the sum of the probabilities in (|8.114p is bounded above by 


J 

E 

j=k*+l 




+ P{F,UH,) 


By Corollaries 18.151 and 18.291 this expression is at most 2J/A;^ -|- e/25 -|- e/24, which is less than 
e/12 for sufficiently large N. □ 

We now combine the results of this section to complete the proof of Proposition 13.81 

Proof of part 3 of Proposition \3.^ It follows from Corollary 18.151 Proposition 18.211 Proposition 
18.221 Corollary 18.291 and Proposition 18.311 that 


y P({Co = oo} n {Cl 1 < Pi]) <- + - + — + — + — 

VLSU J LSij - 25 4 4^ 94 19 


48 24 


j=k*+l 

for sufficiently large N. Also, Remark 18.71 gives that for sufficiently large N, 

J 

y P({Co = oo} n {Ci,j < pj}) < 


e 

12 


j=0 


Because e/25 -|- e/4 -|- e/48 -|- e/24 -|- e/12 -|- e/16 < e/2, it follows that (18.11) holds for sufficiently 
large N. As noted at the beginning of section [SI this completes the proof of part 3 of Proposition 

□ 


9 Proof of Theorems II.ILII.2L and 11.41 

With Proposition 13.81 having been established, in this section we use this result to prove Theorems 
iniira and O All of these theorems follow rather directly from Propositions 13.2113.31 [3Al and 
13.61 which, as noted in section[3l all follow from Proposition [ffiSJ We prove Theorem 1 1.1 1 in section 
19.11 Theorem O in section 19.21 and Theorem 11.41 in section 19.31 
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9.1 The selective advantage of the fittest individuals 


Recall that Q{t), defined in (I1.12|) . is the difference between the number of mutations carried by 
the fittest individual and the mean number of mutations in the population. Consequently, it is a 
measure of the selective advantage that the fittest individuals in the population have over typical 
individuals in the population. Theorem 11.11 describes the asymptotic behavior of the process 
> 0) as the population size tends to infinity. 


Proof of Theorem \l.l[ It suffices to prove (jl.l4l) for S = [u,v\, where either 0<u<u<lor 
l<n<u<oo. In view of part 2 of Proposition 13.61 it suffices to show that 


\Q{aNt) - R{aNt)\ 

sup-;- 

tes kn 


P 0 , 


(9.1) 


where R{t) was defined in (j3.23l) . Throughout the proof, we fix e > 0, 5 > 0, and T > max{l, u}. 

We assume that N is large enough that the conclusions of Propositions 13.21 13.31 13.51 and 
M\ hold with probability at least 1 — e, and we work on the event that the conclusions of these 
propositions hold. Suppose first that 0 < n < u < 1. By Proposition 13.51 we have 


supM(a7vt)<3 (9.2) 

teS 

Also, note that ajyu > t* for sufficiently large N, as can be seen from (|1.7I) . Therefore, by 
part 1 of Proposition 13.61 for sufficiently large N we have < ajsfU. Therefore, recalling 

(I3.23P and using either part 3 of Proposition 13.61 or Remark 13.41 for sufficiently large N we have 
R(oArt) = max{j : tj < aj^t} for all t G S. Suppose R{aNt) = i, so r* < a]\ft < r^+i. By part 
1 of Proposition 13.31 no type i + 2 individual can appear before time Ti+i, which implies that 
max{j : Xj{aj^t) > 0} < i + 1. By part 3 of Proposition 13.31 we have Alj_i(aArt) > 0. Therefore, 
for all t G S, 

R{aNt) — 1 < max{j : Xj{aNt) > 0} < R{a]\rt) + 1. (9.3) 

Combining (j9.2p and (19.3p gives 

sup \Q{aNt) — R{aNt)\ < 4. (9-4) 

tes 

Now suppose instead 1 < tt < u < oo. Recall from (ll.23p that j{t) = max{j : 'jj < a^t}, 
and write h{t) = max{j : tj < awt}- Then, again recalling (j3.23l) . for all t G 5 we have 


R{aNt) = h{t) — j{t). By following again the derivation of (19.3p . we get, for all t G S, 

h{t) — 1 < max{j : Xj{aNt) > 0} < h{t) + 1. (9-5) 

Furthermore, we have ONt G which means \M{aNt) — j(t)| < 2 C 5 by part 3 of 

Proposition 13.51 Therefore, 

sup|M(aArt) -j{t)\ < 2 C 5 . (9.6) 

teS 

It follows from (j9.5p and (j9.6p that 

sup \Q{a]\ft) — R{aNt)\ < 1 + 2 C 5 . (9.7) 

tes 

Because, for sufficiently large N, equation (j9.4p holds with probability at least 1 — e if 0 < 
u < V < 1 and equation (19.711 holds with probability at least 1 — eifl<M<u<oo, we may 
conclude (j9.ip . Finally, the result (|1.15p was established as part of Lemma [721 Cl 
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9.2 The speed of evolution 

Here we obtain Theorem 11.21 which gives the asymptotic behavior of the mean number of muta¬ 
tions in the population and therefore determines the speed of evolution. 


Proof of Theorem \1.2l It suffices to prove (jl.lSI) when S = [u,v\, where either 0<u<u<lor 
1 < u < u < oo. Suppose first that 0 < u < u < 1. Then m{t) = 0 for all t £ S. By part 1 of 
Proposition 13.51 for all e > 0, we have 

p( supM(aArt) < 3 ) > 1 — e 

V t&s ) 

for sufficiently large N. The result (I1.18|) follows immediately. 

Suppose instead that 1 < u < u < oo. We hx e > 0, J > 0, and T > max{l, u}. We assume for 
now that N is large enough that the conclusions of Propositions 13.51 and 13.61 hold with probability 
at least 1 — e, and we work on the event that the conclusions of these propositions hold. Recall 
that j{t) = max{j : jj < a]\ft} = max{j : Tj < aj^it — 1)}, so a^t £ [7i(t)) 7i(t)-i-i) t £ S. 

By part 1 of Proposition 13.61 we have j{t) > k* + 1 for all t € S' if is sufficiently large, so it 
follows from Proposition 13.51 that 

\M{a]yt) — j{t)\ < 2 C 5 for all t £ S. (9-8) 


Now because 1 < tt < u, for all t E S' we have 

rt-l 


m{t) = 1+1 q{u) 

Jo 


du 


= 1 + 


I 


Tfet+i/ajv 


/-Tj+i/ajv rt-l 

q{u) du + I 

I Tj I apf 


(t)/o-N 


q{u) du. 


(9.9) 


q{u) du+ j 

j=k*+l 

We now obtain upper and lower bounds on the expression in (j9.9h . For the upper bound, we use 
(j3.24p along with part 1 of Proposition 13.61 and the fact that q{t) < e for all t by Lemma 17.21 to 
get 


„,(«) < 1 + illl+l + y(4) _ r). L!7i < tfOO + 26) 


, k*\ 2e , , 

+ +!-• (9.10) 

kN J kN 


ajy k]^ k]^ 

For the lower bound, we use (|3.25p and the fact that ( 7 fc*+i/a 7 v) — 1 < 2/A;jv by part 1 of 
Proposition 13.61 to get 

j(t)(l-25) 


m{t) > 1 -t- {j{t) — k* — 1) 


1 - 25 


> 


, k* + l\ 2 


k jv ^ V 

It follows from (19.8p . (|9.10l) . and (19.1111 that there exists a positive constant C, depending on e, 
5, and T, such that 


sup 

tes 


M(a]vt) 


kN 


— m{t) 


< 


25j{t) + C 
kN 


(9.12) 


Because Proposition 13.61 implies that for all t £ S, 

3k N 


j{t) < k* -• unv < k* + 3vkN, 


UN 


and, by our assumptions, the event in (|9.12l) holds with probability at least 1 — e for sufficiently 
large N, the result (|1.18p follows. □ 
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9.3 The distribution of fitnesses in the popnlation 

In this subsection, we prove Theorem 11.41 which describes the distribution of the fitnesses of 
individuals in the population at time a^t. We begin with a lemma concerning the differences 
Tj_|_i — Tj. Recall again the definition of j{t) from (|1.23p . 

Lemma 9.1. For each rj > 0 and t G (1,2) U (2,oo), there exists 9 = 9{r],t) > 0 such that 

"^ ‘^*** ■ = ' 

and, for each fixed r] > 0 and a > 2, 

inf 9{r],t) > 0. (9.13) 

t£[a,oo) 

Proof. Lemma YT7I\ states that the function q is continuous on [0,1) U (l,oo). Also, we can see 
from (|1.13p and the fact that 1 < q{t) < e for all t > 0 that q is uniformly continuous on [l,oo). 
Therefore, we may choose 9 = 9{r],t) > 0 such that the following hold: 

1) \q{t — 1) — q{u)\ < ri/7 for all u G [t — 1 — 30, t — 1 + 30], 

2) [f-l-30,f-l + 30] C (0, l)U(l,oo), 

3) (I9.13P holds for each fixed rj > 0,a > 2. 

Fix e > 0, J G (0,7//14), and T > t. We may assume N is large enough that the conclusions 
of Proposition 13.61 hold with probability at least 1 — e. For now, we will work on the event that 
the conclusions of Proposition 13.61 hold. By (ll.23p . we have < a^t < 7 j(t)+i, and it follows 
that < aN{t — 1) < Therefore, by (|3.26l) . for all j G [j{t) — 9k]\f,j{t) + 9k]\f] H Z such 

that j > k* + 1, we have 

aN{t - 1) - {9kN + 1) • < Tj < Tj+i < aAr(f - 1) + 9kN ■ -r^- 

kN kN 

It follows that for sufficiently large N, 


a7v(t — 1 — 30) < Tj < Tj+i < a7v(i — 1 + 30). 


(9.14) 


Because [t — 1 — 30, t — 1 + 30] C (0,1) U (1, oo), we can see from part 1 of Proposition 13.61 that for 
sufficiently large N, we have j{t) — dk^ > k* + 1. Also, in view of part 1 of Proposition 13.61 for 
sufficiently large N the interval [t —1 — 30, t —1 + 30] will not intersect [1,7fc*+i/aAr]. Therefore, for 
sufficiently large N, equations (|3.24p and (|3.25p imply that for all j G [j{t) — 9kN,j(t) + 9kN]r\’Ij, 
we have 


1-26 p+i/“Jv 

—] - < / q{u) du < 

JTj/aN 


1 + 25 
kw 


Combining this result with (19.141) and condition 1) above, we get that for sufficiently large N, 


(1 — 26)a]\[ 


{q{t - 1) + r]/7)kN 


< Tj + l - Tj < 


(1 + 26)a]T 


{q{t - 1) - r]/7)kN 


for all j G [j{t) — 9ki^,j{t) + 9k!^]{^'L. Because g(rt) > 1 for all n > 0 bv Lemma r7.2l and 6 < r//14, 
we have 

(1 - 2(l)g(f - 1) ^ {l + 26)q{t-l) < ^ ^ 

3“ qit-l) + r)/7 “ g(t - 1) - r?/7 “ 3 

if rj is sufficiently small. Because e > 0 is arbitrary, the result follows. □ 
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Proof of Theorem \1.4\ Let ?? > 0 and t G (1, 2)U (2, oo). Choose 9 = 6{r], f) such that 0 < 0 < 1/4 
and the three conditions at the beginning of the proof of Lemma fO.ll are satisfied. As in the proof 
of Lemma l9.ll choose e > 0, 5 € (0, r;/14), and T > t. We may assume that N is large enough 
that the conclusions of Propositions 13.31 and 13.61 hold with probability at least 1 — e. For now we 
will suppose the conclusions of Propositions I.S..SI and 13.61 hold. 

Suppose £ is an integer with \i\ < Ok^- As noted following (19.1411 in the proof of Lemma l9.11 
the fact that [t—1—36, t—1+39] C (0,1)U(1, oo) implies, if N is large enough, that j{t)+i > k* + l. 
Furthermore, because 9 < 1/4, for sufficiently large N we have > 7i(i)+i ^ 

by (j9.14p . < ow(t — 1 + 39) < ajsft. Therefore, by (j3.20|l . 

Consider hrst the upper bound when 1 < i < 9kj^. We have 


log 


f ^j(t)+eiO'Nt) 




< log 
= log 

= log 


1 + 5 
1-5 
1 + 5 
1-5 

1 + 5 
1-5 

e 


+ 


ro-Nt 

Tai+\\i 


Gj^t)+i{v) dv - 


'rj{t)+i+i 

Tj(^t)+l+l 


ra,Nt 

•^+(0+1 


Gj{t){v) dv 


+(t)+l 

^+(t)+®+l 


Gj(t)+e{v) dv + si{aNt - Tji^t)+i) 

Gj(^t)+i{v) dv 

i=l 

i=l 




By Lemma l 6 .ll 


■^j(i)+i+i 


E 

i=l “'+(*)+» 


Gj(t)+i(u) dv < -£log 




By Lemma l9.ll with probability tending to one as —)• oo, we have 






E(« 


2 = 1 


(1 - r]/3)saN 
q{t - l)kN 
(1 rj/3)£{e - 1 ) 


- ^ 


log 


2q{t — l)k]\[ 

Also, by (II. 241) and Lemma l9.11 with probability tending to one as A —)• oo we have 

(1/2 - d{t)){l - r]/3)aN 


- ant = (1/2 - d(t))(7j(t)+i - > 

which leads to 


q{t - l)kN 


(9.15) 


(9.16) 


(9.17) 


si{a]\ft 


Tjit) + l) 


<«og(i) 


£(l/2-d(t))(l-r?/3) 
q{t - l)kN 



(9.18) 
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Combining (19.151) . (19.161) . (19.171) . and (|9.18p and using that 




^,1 \ 


(9.19) 


we get 


log 


Xj(t)+eiaNt) \ 


< log 


l + <5 
1-5 


+ 




The argument for the lower bound follows the same steps. Prom Lemma EH we get 2 in place 
of 1/Cq in (|9.16p . and the result becomes 


f Xj^t)+eiaNt) 


. (1 + ^/3) f f-2d{t)l 

1 + 5) qit-l)kN[ 2 


log(-l. 


(9.21) 


Since |d(t)| < 1/2, we have — 2d{t)i)/2 < + i')l2 < (?. Since q{t — 1) > 1 by Lemma 171^ 

and log{s/^)/kN —>■ 00 as —>■ 00 by (11.71) . it follows that when (|9.20l) and (|9.21l) hold and N is 
sufficiently large, we have 


1 f ^j{t)+eiaNt)\ f- 2 d{t)£ ( 

V Xj(t){aNt) ) 2 q{t - l)kN V 


< 


77^^ log(s/7r) 
kN 


(9.22) 


Suppose now that —9ki\f < i < —1. The proof is similar to the case in which i is positive. 
For an upper bound, note that 

(^=) ^ +£1., - £1 


< log 
= log 
= log 


1-5 
1 + 5 
1-5 

1 + 5 
1-5 
0 


+ 


/ 


+(t)+«+i 
+ (*)+! 


Gj(t)+e{v) dv + s£{aNt - Tj(t)+i) 


LW+^+i 
0 


+ E 


+ (7) + '‘+l 


Gj{t)+ii'>j) dv 


i=l+l +(7)+» 

+ ^ ~ i){'^j{t)+i+i ~ Tj{t)+i) + si{aNt — rj(4)_,_i). 


i=l+l 


Using Lemma EH again. 


(9.23) 


" /-LCO+i+i /' 2 s 

^ / Gj(^t)+iiv) dv <-ilogl — 

i=l+l d'rj(t)+i V + 

By Lemma l9.ll with probability tending to one as iV —?> 00 , we have 
E S(£ - i){T^^t)+i+l - U(7)+i) < ^ E 


i=i+l 


i=i+l 


(1-W3)^(^-1) 


2 q{t — l)k]sf 




(9.24) 


(9.25) 
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Repeating the reasoning that leads to (I9.18P gives 


f,, . ^ (^\ ^(1/2 - d(i))(l+ ^/3) , (s\ 

-( 9 - 26 ) 

Combining (|9.23p , (I9.24p , (|9.25p , and (j9.26l) , and then using (I9.19P again along with the inequality 
- l)/2 - £(1/2 - d{t)) < £2/2 - 3£/2 < 2£2, we get 


log 




< log 


1 + h 

1-5 


£log 2 


£2 — 2(i(£)£ /s\ 2 r 7£2 log(s//i) 

2g(t — l)/cAr 3q{t — l)kN 


By following the same steps, we obtain the analogous lower bound 


/ Xj(t)+^(ajvt) \ 



+ £ log Ce 


£2 - 2 d{t)i /s\ 

2g(t - l)A:Ar V/f^y 


2 r]£‘^ log{s/fj,) 
3q{t — l)kN 


Since q{t — 1) > 1 by Lemma [7.21 and log{s/fj,)/kN —)• oo as —>■ oo by (jl.7p . it follows from 
these upper and lower bounds that ()9.22p holds for sufficiently large N in this case as well. 

Since (I9.22h is trivial when £ = 0, equation (I9.22h holds for all £ € [6k]\f,6k]\f] n Z with 
probability at least 1 — e, if is large enough. Recalling (II.lip , since e > 0 was arbitrary, 
Theorem 11.41 follows. □ 
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